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Description 

BACKGROUND OF THE INVENTION 
. Field of the Invention 
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Mart, .nd .ragmen* polypeptide * »» SS^S." "-*■ «"** We """"I 

my. comprising the polynucleotide " d , ,n «^ 1 ^^nrrlconW. ano use « them as well i™*"* 1 

2 Brief Description of the Background Art 

[00021 Colter. bacteha are use, in pacing 

^^^^ 

ducing bacterium, and many amino acids are P' 0 ^.^ 
giutamic add which is usefulasaseasonmgforumam(deto 

^OoT^^ 

mutants) whteh have a mutated metabolic ^^^J^^SL it needs. In the biosynthesis of L- 
various metabolic regulatory systems so as not to pro « >*™ un dersuch regulation as prevent.ng 

lysine, for example, a microorganism belonging to the ^Z^o^e loZ^ the activity of a biosynthesis enzyme 
the excessive production by concerted inhib rtI on by W 65. 849-859 (1 969)). The biosyn- 

common to lysine, threonine and meth.on.ne, i.e.. an aspartokinase (J. a «ocn . ^ tQ bjosyn . 

tresis of argLe is controlled by repressing the ^^J^^S- ^considered that these metabolic 
thesize an excessive amount of aramine ( M ' cro *^^^ 

regulatory mechanisms are deregulated in ammo ac J^JJJ^ an d analogues of the above-desenbed 

ulated in mutants producing nucleic acids, vitamins, saccharides, organic acios a 

regulating the growth and metabolism of these mi ° ro °^ Sm J a _ fcum ATCC 130 32 is reported and it is known that 

the nucleotide sequences of most genes have not bee " c 'f^ n d o ~° several microorganisms, such as Escherichia 
[00071 in recent years, the full nucleotide sequence of the ge n ^^^^ gnee 277, 1453-62 (1997); Nature. 

39* 537-544 (1998). Nature, 387. 5-105 (1997)). Based ^J^,,,,. seqU ences of known genes have 

bTi^^^ 

ncously or detecting mutations, using DNA chips. to a solid support, have been 

gene or a partial nucleic acid fragment in genom.c DNA other than a gene ■ *x m tubeiculosiSi 

^ped^t^^ 

Mycobacterium bovis used in BCG vaccines, and the like (Science, 27a. oou 
9&. 12833-38 (1999); Science, 284: 1520-23 (1999)). 
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SUMMARY OF THE INVENTION 

[0009] An object of the present invention is to provide a polynucleotide and a polypeptide derived from a microor- 
ganism of coryneform bacteria which are industrially useful, sequence information of the polynucleotide and the 
polypeptide, a method for analyzing the microorganism, an apparatus and a system for use in the analysis, and a 
method for breeding the microorganism. 

[0010] The present invention provides a polynucleotide and an oligonucleotide derived from a microorganism be- 
longing to coryneform bacteria, oligonucleotide arrays to which the polynucleotides and the oligonucleotides are fixed, 
a polypeptide encoded by the polynucleotide, an antibody which recognizes the polypeptide, polypeptide arrays to 
which the polypeptides or the antibodies are fixed, a computer readable recording medium in which the nucleotide 
sequences of the polynucleotide and the oligonucleotide and the amino acid sequence of the polypeptide have been 
recorded, and a system based on the computer using the recording medium as well as a method of using the polynu- 
cleotide and/or polypeptide sequence information to make comparisons. 

BRIEF DESCRIPTION OF THE DRAWING 

[001 1] Fig. 1 is a map showing the positions of typical genes on the genome of Corynebacterium gtutamicum ATCC 
13032. 

[0012] Fig. 2 is eleclrophoresis showing the resulls of proteome analyses using proteins derived from (A) Coryne- 
bacterium gtutamicum ATCC 13032, (B) FERM BP-7134, and (C) FERM BP-158. 

[0013] Fig. 3 is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

[0014] Fig. 4 is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[0015] This application is based on Japanese applications No. Hei. 11-377484 filed on December 16, 1999, No. 
2000-159162 filed on April 7, 2000 and No. 2000-280988 filed on August 3, 2000, the entire contents of which are 
incorporated hereinto by reference. 

[0016] From the viewpoint that the determination of the full nucleotide sequence of Corynebacterium gtutamicum 
would make it possible to specify gene regions which had not been previously identified, to determine the function of 
an unknown gene derived from the microorganism through comparison with nucleotide sequences of known genes 
and amino acid sequences of known genes, and to obtain a useful mutant based on the presumption of the metabolic 
regulatory mechanism of a useful product by the microorganism, the inventors conducted intensive studies and, as a 
result, found that the complete genome sequence of Corynebacterium gtutamicum can be determined by applying the 
whole genome shotgun method. 

[0017] Specifically, the present invention relates to the following (1 ) to (65): 
(1 ) A method for at least one of the following: 

(A) identifying a mutation point of a gene derived from a mutant of a coryneform bacterium, 

(B) measuring an expression amount of a gene derived from a coryneform bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium, 

(D) analyzing expression patterns of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a coryneform bacterium, 

said method comprising: 

(a) producing a polynucleotide array by adhering to a solid support at least two polynucleotides selected 
from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any 
one of SEQ ID NOS:1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 
stringent conditions, and third polynucleotides comprising a sequence of 10 to 200 continuous bases of 
the first or second polynucleotides, 

(b) incubating the polynucleotide array with at least one of a labeled polynucleotide derived from a co- 
ryneform bacterium, a labeled polynucleotide derived from a mutant of the coryneform bacterium or a 
labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 



EP 1 108 790 A2 



10 



is 



20 



25 



30 



35 



40 



45 



50 



55 



^ , herein ' f0f example ' ,he at least ^ polynucleotides can be at least two of the first polynu- 
tuTJ?f h S ' ? f 3St ^ the SeC0 " d P o| y nud eotides. at least two of the third polynucleotides, or at least 
two of the first, second and third polynucleotides. 

Co/^LaSlTJn" 9 t0 i 1) ' ? erei " COr * neiom bacterium «• » microorganism belonging to the genus 
Corynebactenum, the genus Brevibacterium, or the genus Microbacterium 

from Jhe^muntr 0 ?" 9 ^ thS microoraanism belonging to the genus Corynebactenum is selected 

umm^™££^ °°0«-»c« W *«,h«u* QvM,^ ^efcacter/- 

wT-S?!!!^ ' Co '^ ,ef ' acte ' IU ' n ^"^am/nooenes, and Corynebactenum ammoniagenes. 
M^Zt^Tf ( f 1 L Wherein I" 6 P o| y nucleotide deriv «d from a coryneform bacterium, the polynuce- 
o Z bZnlr ,7 COr y neform bacterium °< the polynucleotide to be examined is a gene relating 

(ejlpo^ 

o! ia ;T e r/ 0lynUC,e0tiCleS Se ' eCt6d ,r ° m the 9mUp COnsistin 9 of firet Polynucleotides comprising the nucle- 
^X^ZZTrT bV r y ° ne ° f SE ° ' D NOS:1 10 3501 • second Poiynucleotides which hybridue 
t^ousb^ZTT Unde : stri "9 ent ^mons, and third polynucleotides comprising 1 0 to 200 con- 
tinuous bases of the first or second polynucleotides, and 
a solid support adhered thereto. 

at letr^of 0 ^! 0 ' TT' ^ ' 0aSt tW ° P 0, y nuclc °«dcs can bo at least two of the first polynucleotides, 

iss ttr P o; e s t rs uc,eotides> at ieast ^ ° f the third - - — ^ * ^ ^ 

i B ^ U l S^^t-?T'?S fl ° f thS nucleotide ^Puences represented by SEQ ID NOS:2 to 3431 or 

a polynucleotide which hybndizes with the polynucleotide under stringent conditions 

NOS-3502 to SS" enC ° di r 9 3 p0lypeptide havi "9 a "V one <* the amino acid sequences represented by SEQ ID 
H m A !ot^,H •,° r 8 po * nucleotide which "ybridizes therewith under stringent conditions. 
nucfM^n T° h " Pre , Sent in the 5 ' UpStream or 3 ' ^stream of a polynucleotide comprising the 
In Sequence of a "y one of SEQ ID NOS:2 to 3431 in a whole polynucleotide comprising the nucleot de 

miTszEr 1 by seq ,D , No:i - and has an ° f re9u,atina an expressfon of r P Tnudeo r d r 

anv one om^ m C ° mPn f ' 09 1 ,° 10 200 continuous base * * the nucleotide sequence of the polynucleotide of 

Zp^S 

(12) A recombinant DNA comprising the polynucleotide of any one of (8) to (11) 

recovenng the polypeptide from the medium. 

nuc^l^f l ran t Sf0mant ° f i2 3) 3 medium t0 pr ° dUCe and acc ^ulate at least one of an amino acid, a 
fT'"' 8 SaCOhande - an acid - analogues thereof in the medium, and 

mZZZ 1° S r° u ° amin ° aCid ' th ° nUC,CiC aCid " thc vitamin - saccharide, the organic acid, 
^«nd analogues thereof from the medium. «*. «..u, 

2 tos^i^^^ 6 enC ° ded by 8 po, y nucle °tide comprising the nucleotide sequence selected from SEQ ID NOS: 



J po| yP e P tide comprising the amino acid sequence selected from SEQ ID NOS-3502 to 6931 
(18) The polypeptide according to (16) or (17). wherein at least one amino acid is deleted, replaced, inserted or 
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polypeptide. P VW " tle ol (16) or < ,7 >- and havl "9 ■» >*«y which is substantially the same as that ol the 

a solid support adhered thereto. 
(22) A polypeptide array, comprising: 

aso'd^ 

cfrinlr^ « ™ or a la r gel structure mo, derived from a 

ml comn t0r f 9C ;K d0ViCC f ° r St ' 0aSt tom P° rari 'y coring tho input information; 
<rv) an output device that shows a screening or analog r»u« obtained by the comparator, 
(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS-1 to 3501 t«mat 

^r^r m ~~^„^ « -» «" - • «■»« ^rucure — denved _ a 

™TJ . , ' ^ sequence or target structure mow information; 
i ) a co^'f^ T* 1 * '" *' " > " 5 ' """P"""* "Wns the input inloimaiion; 

de^e foHc ™, * qUenC< ' lar9e " 8lru0 * ,re ™" ">'»™«ion, records by the data storage 

.he^r-r^ 

a " ° utp,rt **° *" <"">«« • •ooontoB or analyzing result obtained by tho comparator. 
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target sequence or target structure motif information. 
^"oTo^toraMe^ 

2 ,„ 3501 with the target ..uoleotw "r^l^rnce^hrtT^ldent with or analogs to M po* 
SL^TZe- that .»ow S a tunction obtained by «. oor.pa.ato.. 

following: 

fmm SEQ ID NOS:2 to 3501 , function in- 

!^™.T^~S»o. — - SEQ . N0~ ,o 3*01 .» 

the target nucleotide sequence information; and . c|eotide having the target nucleotide sequence 

from SEQ ID NOS:2 to 3501 . 

a coqnriom. baotarium. oomprMng M Mow,ng. 

SrLJlrom a conform bactanum. cocnprteiag the Conowing. 

. - ^ fr« m qfo in NOS'3502 to 7001 , function 

S'^scsss srs — - - seq id nos:3602 ,o 7001 

with the target amino acid sequence informal. on; and seqU ence which is coincident with or 

7001. 

,3„T*a s*e,om accord » an, « - (.3,. WW - «• — ' « ~»— " — ' 
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ganism of the genus Cory nebacteri urn, the genus Brevibacterium, or the genus Microbacterium. 

(32) The method according to any one of (24), (26), (28) and (30). wherein a coryneform bacterium is a microor- 
ganism of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

(33) The system according to (31 ). wherein the microorganism belonging to the genus Corynebacterium \s selected 
5 from the group consisting of Corynebacterium gtutamicum, Corynebacterium acetoacidophilum, Corynebacterium 

acetogtutamicum, corynebacterium callunae, corynebacterium hercutis, Corynebacterium lilium, Corynebacterium 
me!assecola f Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(34) The method according to (32), wherein the microorganism belongingto the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophitum, Corynebacterium 

to acetogtutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium lilium, Corynebacter'h 

urn metassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(35) A recording medium or storage device which is readable by a computer in which at least one nucleotide 
sequence information selected from SEQ ID NOS:1 to 3501 or function information based on the nucleotide se- 
quence is recorded, and is usable in the system of (23) or (27) or the method of (24) or (28). 

is (36) A recording medium or storage device which is readable by a computer in which at least one amino acid 

sequence information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid 
sequence is recorded, and is usable in the system of (25) or (29) or the method of (26) or (30). 

(37) The recording medium or storage device according to 

(35) or (36), which is a computer readable recording medium selected from the group consisting of a floppy disc, 
20 a hard disc, a magnetic tape : a random access memory (RAM), a read only memory (ROM), a magneto-optic disc 

(MO), CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM and DVD-RW. 

(38) A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the 
Val residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform 
bacterium is replaced with an amino acid residue other than a Val residue. 

25 (39) a polypeptide comprising an amino acid sequence in which the Val residue at the 59th position in the amino 

acid sequence as represented by SEQ ID NO:6952 is replaced with an amino acid residue other than a Val residue. 

(40) The polypeptide according to (38) or (39), wherein the Val residue at the 59th position is replaced with an Ala 
residue. 

(41) A polypeptide having pyruvate carboxylase activity, comprising an amino acid sequence in which the Pro 
30 residue at the 458th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform 

bacterium is replaced with an amino acid residue other than a Pro residue. 

(42) A polypeptide comprising an amino acid sequence in which the Pro residue at the 458th position in the amino 
acid sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

(43) The polypeptide according to (41 ) or (42), wherein the Pro residue at the 458th position is replaced with a Ser 
35 residue. 

(44) The polypeptide according to any one of (38) to (43), which is derived from Corynebacterium glutamicum. 

(45) A DNA encoding the polypeptide of any one of (38) to (44). 

(46) A recombinant DNA comprising the DNA of (45). 

(47) A transformant comprising the recombinant DNA of (46). 

40 (48) A transformant comprising in its chromosome the DNA of (45). 

(49) The transformant according to (47) or (48), which is derived from a coryneform bacterium. 

(50) The transformant according to (49), which is derived from Corynebacterium glutamicum. 

(51 ) A method for producing L-lysine, comprising: 

4 * culturing the transformant of any one of (47) to (50) in a medium to produce and accumulate L-lysine in the 

medium, and 

recovering the L-lysine from the culture. 



(52) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:1 to 3431 , comprising the following: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 
an amino acid, a nudeic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtained by (i); 

(iii) introducing the mutation point into a coryneform bacterium which is free of the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
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bacterium obtained in (Hi). 

(53) The method according to (52) , wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
a signal transmission pathway. 

5 (54) The method according to (52), wherein the mutation point is a mutation point relating to a useful mutation 

which improves or stabilizes the productivity. 

(55) A method for breading a corynefomn bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:1 to 3431 , comprising: 

10 (j) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 

rium which has been subjected to mutation breeding so as to produce at least one compound selected from 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431; 
(ii) identifying a mutation point present in the production strain based on a result obtain by (i); 

is (jjj) deleting a mutation point from a coryneform bacterium having the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
bacterium obtained in (iii). 

(56) The method according to (55), wherein the gene is a gene encoding an enzyme in a biosynlhelic pathway or 
20 a signal transmission pathway. 

(57) The method according to (55), wherein the mutation point is a mutation point which decreases or destabilizes 
the productivity. 

(58) A method for breeding a corynefomn bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:2 to 3431 , comprising the following: 

25 

(i) identifying an isozyme relating to biosynthesis of at least one compound selected from an amino acid, a 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 
quence information represented by SEQ ID NOS:2 to 3431 ; 

(ii) classifying the isozyme identified in (i) into an isozyme having the same activity; 

30 (Hi) mutating all genes encoding the isozyme having the same activity simultaneously; and 

(iv) examining productivity by a fermentation method of the compound selected in (i) of the coryneform bac- 
terium which have been transformed with the gene obtained in (iii). 

(59) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
35 SEQ ID NOS:2 to 3431 , comprising the following: 

(i) arranging a function information of an open reading frame (ORF) represented by SEQ ID NOS:2 to 3431 ; 

(ii) allowing the arranged ORF to correspond to an enzyme on a known biosynthesis or signal transmission 
pathway; 

40 (jjj) explicating an unknown biosynthesis pathway or signal transmission pathway of a coryneform bacterium 

in combination with information relating known biosynthesis pathway or signal transmission pathway of a co- 
ryneform bacterium; 

(iv) comparing the pathway explicated in (iii) with a biosynthesis pathway of a target useful product; and 

(v) transgenetically varying a coryneform bacterium based on the nucleotide sequence information to either 
45 strengthen a pathway which is judged to be important in the biosynthesis of the target useful product in (iv) or 

weaken a pathway which is judged not to be important in the biosynthesis of the target useful product in (iv). 

(60) A coryneform bacterium, bred by the method of any one of (52) to (59). 

(61) The coryneform bacterium according to (60), which is a microorganism belonging to the genus Corynebac- 
50 terium, the genus Brevibacterium, or the genus Microbacterium. 

(62) The coryneform bacterium according to (61 ), wherein the microorganism belonging to the genus Corynebac- 
tcrium is selected from the group consisting of Coryncbactcrium gtutamicum, Coryncbactcrium acctoaddophilum, 
Corynebacterium acetoglutamicum, Corynebactehum caliunae, Corynebacterium herculis, Corynebacterium Hl- 
ium, Corynebacterium meiassecoia, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

55 (63) A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a 

saccharide, an organic acid and an analogue thereof, comprising: 

culturing a coryneform bacterium of any one of (60) to (62) in a medium to produce and accumulate at least 
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one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof; 

recovering the compound from the culture. 

(64) The method according to (63), wherein the compound is L-lysine. 

(65) A method for identifying a protein relating to useful mutation based on proteome analysis, comprising the 
following: 

(i) preparing 

a protein derived from a bacterium of a production strain of a coryneform bacterium which has been sub- 
jected to mutation breeding by a fermentation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 

(ii) separating the proteins prepared in (i) by two dimensional electrophoresis; 

(Hi) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; 

(iv) treating the protein showing different expression amounts as a result of the comparison with a peptidase 
to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (iv); and 

(vi) comparing the amino acid sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

As used herein, the term "proteome", which is a coined word by combining "protein" with "genome", refers to 
a method for examining of a gene at the polypeptide level. 

(66) The method according to (65). wherein the coryneform bacterium is a microorganism belonging to the genus 
Corynebacterium, the genus Brevibacterium, or the genus Microbactehum. 

(67) The method according to (66), wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogfutamicum, Corynebacterium callunae, corynebacterium hercutis, Corynebacterium lilium Corynebacterium 
melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(68) A biologically pure culture of Corynebacterium glutamicum AHP-3 (FERM BP-7382). 

[0018] The present invention will be described below in more detail, based on the determination of the full nucleotide 
sequence of coryneform bacteria. 

1 . Determination of full nucleotide sequence of coryneform bacteria 

[0019] The term "coryneform bacteria" as used herein means a microorganism belonging to the genus Corynebac- 
terium, the genus Brevibacterium or the genus Microbacterium as defined in Bergeys Manual of Determinative Bacte- 
riology, 5:599(1974). 

[0020] Examples include Corynebacterium acetoacidophilum, Corynebacterium acetoglutamicum, Corynebacterium 
callunae, Corynebacterium glutamicum, Corynebacterium hercutis, Corynebacterium lilium, Corynebacterium metas- 
secola, Corynebacterium thermoaminogenes, Brevibacterium saccharolyticum, Brevibacterium immariophilum, Brevi- 
bacterium roseum, Brevibacterium thiogenitaiis, Microbacterium ammoniaphilum, and the like. 

[0021 ] Specific examples include Corynebacterium acetoacidophilum ATCC 1 3870, Corynebacterium acetoglulami- 
cum ATCC 15806, Corynebacterium callunae ATCC 15991, Corynebacterium glutamicum ATCC 13032, Corynebac- 
terium glutamicum ATCC 13060, Corynebacterium glutamicum ATCC 13826 (prior genus and species: Brevibacterium 
ftavum, or Corynebacterium lactofermentum), Corynebacterium glutamicum ATCC 14020 (prior genus and species: 
Brevibacterium divaricatum), Corynebacterium glutamicum ATCC 13869 (prior genus and species: Brevibacterium 
lactofermentum), Corynebacterium hercutis ATCC 13868, Corynebacterium lilium ATCC 15990, Corynebacterium 
melassecola ATCC 17965, Corynebacterium thermoaminogenes FERM 9244, Brevibacterium saccharolyticum ATCC 
14066, Brevibacterium immariophilum ATCC 14068, Brevibacterium roseum ATCC 13825, Brevibacterium thiogenitaiis 
ATCC 19240, Microbacterium ammoniaphilum ATCC 15354, and the like. 
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(1) Preparation of genome DNA of coryneform bacteria 



[00221 Coryneform bacteria can be cultured by a conventional method. pf fi cie nt 
0023 Any of a natural medium and a synthetic medium can be used, so iong as .t » a "^^^^ 
5 culturing of the mteroorganism, and it contains a carbon source, a nitrogen source, an .norgamc salt, and the l.ke wh.ch 
can be assimilated by the microorganism. <A„ii„ a „t«i» ^ sodium 

[0024] in Corynebacterium gUmfcm for example, a BY medium (7 g/l meat extract 10 grf 
chloride, 5 g/l yeast extract, pH 7.2) containing 1% of glycine and the like can be used. The cultunng ,s earned out 

10 WwlZtoecLpleton of the culture, the cells are recovered from the culture by centrifugation. The resuming 

cells are washed with a washing solution. ■„.-,„ t~ h.,Hmrhinririe mmol/ 

[0026] Examples of the washing solution include STE buffer (1 0.3% sucrose, 25 mmoW I Tns hydrochlonde, 25 mmol/ 
I ethylenediaminetetraacetic acid (hereinafter referred to as "EDTA"), pH 8.0), and the like. obtainina ae . 

r00271 Genome DNA can be obtained from the washed cells according to a conventional method for obtamingge 

« l Z?OH™Z™X the cel. wall of the ce..s using a lysozyme and a surfactant ^^J^^^ 
and the like using a phenol solution and a phenol/chloroform solution, and then preciprtating the genome DNA win 
ethanol or the like. Specifically, the following method can be illustrated. .uawnn c to 

mm The washed cells are suspended in a washing solution containing 5 to 20 mg/l lysozyme. After shaking 5 to 
Lo^DS "s addedto lyse the cells In usual, shaking is gently performed at 25 to 40'C for 30 minutes to 2 hours. After 

so shaking, the suspension is maintained at 60 to 70»C for 5 to 15 minutes for the ^lys.s. neutra|ized Dheno , is 

[0029] Afterthe lysis, the suspension is cooled to ordinary temperature, and 5 to 20 ml of Tns-neutrahzed phenol 
added thereto, followed by gently shaking at room temperature for 1 5 to 45 minutes aaueous | aV er 

[0030] Aftershaking, centrifugation (15.000 x g, 20 minutes, 20«C) is earned out to tl ^^^^^r 
0031 Aftorporforming extraction with phonol/chloroform and extraction w,th chloroform ^> ^E^^S 

25 3 mo,/, sodium acetate solution (pH 5.2) and isopropano. are added to the aqueous laye ' * <^ 

times volume, of the aqueous layer, respectively, followed by gentry sfmng to J*J 9 of the 

[0032] The genome DNA is dissolved again in a buffer containing 0.01 to 0.04 mg/ml ^^^T^^^ 
buffer TE buffer (10 mmol/l Tris hydrochloride, 1 mol/l EDTA, pH 8.0) can be used. After dissolving, ^^u'tarn 
slttoni"^ 

30 and chloroform as in the above case. ^—^.tnte k washed with 

[0033] Afterthe extraction, isopropanol precipitation is carried out and the resulting ™A pwtete 18 washed wrth 
70% ethanol, followed by air drying, and then dissolved in TE buffer to obtain a genome DNA solution. 

(2) Production of shotgun library 

[0034] A method for produce a genome DNA library using the genome DN A of ^^"^^"J i£E2£ 
heabove(1)includeamethoddescrir*KiinM^^^ 

referred to as 'Molecular Cloning, 2nd ed."). In particular, the following method can be exemplified to prepare a genome 
DNA library appropriately usable in determining the full nucleotide sequence by ^ such ^ TE 

f00351 To 0 01 mq of the genome DNA of the coryneform bacteria prepared in the above (1 ) , a buffer, sucn as i t 
Luff ePor me .ike, IV^Xo give a total volume of 0.4 ml. Then, the genome DNA is digested into IJ^^! * 
1 0 will I sonicator (Yamato Powersonic Model 50). The treatment with the sonicator ,s performed at an output of 

loosr?^ 

45 ?0037]" ke The blunt-ended genome fragments are fractionated by agarose gel or potyacry.amide gel electrophoresis 
^To^gT^ 

10 mmol/. magnesium acetate, 1 mmol/l EDTA, 0.1% SDS) or the like, is added, fo.lowed by shaking at 25 to 40 C 
" [O^r^'esX DNA e.uate is treated w«h phenoL/ch.oroform and then precipitated with ethanol to obtain a 

Biotech) or the like, usfng T4 ligase (manufactured by Takara Shuzo) or the like. The bgebon can be carried out by 
ss allowing a mixture to stand at 10 to 20°C for 20 to 50 hours. 

[0041] The resulting ligation product is precipitated with ethanol and dssolved in 5 to 20 ul of TE ^buffer 

[0042 Escherichto coli is transformed in accordance with a conventual method usmg 0_5 to £ £ Id f the g*on 

solution Examples of the transformation method include the electroporation method us.ng ELECTRO MAX DHIOB 
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cZitnTardelri^H 0109 ' 68 ; EsChe " chia co,i ^ e.ectroporation method can be carried out under the 
conditions as described in the manufacturer's instructions. 

n?»Ti J* 16 trans,onTied Escherichia coB is spread on a suitable selection medium containing aaar for examole LB 

Side ^ZScoZn 9 ! 0 ! O e l 00 f m9/, rf""" (LB m6diUm (1 ° 9/1 baCt0t ^ ton - 5 bC2S^iJZ5™ 
[Owi? ' ™ e transfor^r^n ? * a9ar > When pUC18 is U6ed as the Zoning vector, and cultured therein 
theTinsfermlnt h I ^ M C °'° nieS fomied on the P ,ate mediu ™- this step, it is possible to select 

™tssi5!£rs^sir; -~ dna - — ^ - ^ ~ 

(3) Production of cosmid library 

resWcLn^n^mr" 16 k NA ( o 1 m9) °' thS c °W*o m bacteria prepared in the above (1) is partially digested with a 
restnction enzyme, such as SauSAI or the like, and then ultracentrifuged (26,000 rpm 18 hours 20-C) under a To .to 

fragmint^ln^h?^ "SET " ,0 * C ° Smid VOCt ° r havin 9 a cohcsivc cnd which can bo ligatod to tho 

t.ons 9 PerC ° Sl < manufac turedbyStratagene) in accordance with the manufactured instru^ 

S^mXZ^IVZT, b JKT d T°. a packaging extract which <*" be b * a meth ° d 

^(ma'^^^ 

S. transformed Escherichia coii is spread on an LB plate medium containing ampicil.in, and cuttured 

E! ^ £ ans ! ormant can be opined colonies formed on the plate medium. 

SSL^^^l^S b f "T? an eXP6riment ° f (4) deSCribed be,ow - Also - tne cu,tu ^ Nation can 

(4) Determination of nucleotide sequence 
(4-1) Preparation of template 

[0056] Specifically, the template can be prepared as follows 

bvG?N™?n?n ne d f ed i f ? m thC Wh0 ' e 98n0me Shotgun ,ibra, V is ""ciliated by using a replicator (manufactured 

S, r"£ l0 n o^Z e *TL° n * USin9 3 C ° Py P ' ate ^ Tokken). into each weH of a 

bJ^5^?S!Sr^T V T B ' OSyStems) to which 0025 ml P" «•» of a PGR reaction solution has 
P^^SSS^STiS^S tnm TT7i £ Takara ShUZ0) T1,en - PCR fe Carried out in aocordance with the 
Uster^r^ (1 " 8 » USi " g °~*"P PCR °™ (manufactured by PE 
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[0059] The excessive primers and nucleotides are eliminated using a kit for purifying a PCR product, and the product 
is used as the template in the sequencing reaction. 

[0060] It is also possible to determine the nucleotide sequence using a double-stranded DNA plasmid as a template. 
[0061] The double-stranded DNA plasmid used as the template can be obtained by the following method. 

5 [0062] The clone derived from the whole genome shotgun library is inoculated into each well of a 24- or 96-well plate 
to which 1 .5 ml per well of a 2 x YT medium (1 6 g/l bactotrypton, 10 g/l yeast extract, 5 g/l sodium chloride, pH 7.0) 
containing 0.05 mg/ml ampicillin has been added, followed by culturing under shaking at 37°C overnight 
[0063] The double-stranded DNA plasmid can be prepared from the culture solution using an automatic plasmid 
preparing machine KURABO PI-50 (manufactured by Kurabo Industries), a multiscreen (manufactured by Millipore) 

10 or the like, according to each protocol. 

[0064] To purify the plasmid, Biomek 2000 manufactured by Beckman Coulter and the like can be used. 

[0065] The resulting purified double-stranded DNA plasmid is dissolved in water to give a concentration of about 0.1 

mg/ml. Then, it can be used as the template in sequencing. 

*5 (4-2) Sequencing reaction 

[0066] The sequencing reaction can be carried out according to a commercially available sequence kit or the like. A 
specific method is exemplified below. 

[0067] To 6 jxl of a solution or ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit (manufactured 
20 by PE Biosystems), 1 to 2 pmol of an M13 regular direction primer (M 13-21) or an M13 reverse direction primer 
(M13REV) (DNA Research, 5: 1-9 (1998)) and 50 to 200 ng of the template prepared in the above (4-1) (the PCR 
product or plasmid) to give 10 u,l of a sequencing reaction solution. 

[0068] A dye terminator sequencing reaction (35 to 55 cycles) is carried out using this reaction solution andGeneAmp 
PCR System 9700 (manufactured by PE Biosystems) or the like. The cycle parameter can be determined in accordance 
2 * with a commercially available kit, for example, the manufacture's instructions attached with ABI PRISM Big Dye Ter- 
minator Cycle Sequencing Ready Reaction Kit. 

[0069] The sample can be purified using a commercially available product, such as Multi Screen HV plate (manu- 
factured by Millipore) or the like, according to the manufacture's instructions. 

[0070] The thus purified reaction product is precipitated with ethanol, dried and then used for the analysis. The dried 
30 reaction product can be stored in the dark at -30°C and the stored reaction product can be used at any time. 

[0071] The dried reaction product can be analyzed using a commercially available sequencer and an analyzer ac- 
cording to the manufacture's instructions. 

[0072] Examples of the commercially available sequencer include ABI PRISM 377 DNA Sequencer (manufactured 
by PE Biosystems). Example of the analyzer include ABI PRISM 3700 DNA Analyzer (manufactured by PE Biosystems). 

35 

(5) Assembly 

[0073] A software, such as phred (The University of Washington) or the like, can be used as base call for use in 
analyzing the sequence information obtained in the above (4). A software, such as Cross Jvlatch (The University of 
*o Washington) or SPS Cross_Match (manufactured by Southwest Parallel Software) or the like, can be used to mask 
the vector sequence information. 

[0074] For the assembly, a software, such as phrap (The University of Washington), SPS phrap (manufactured by 
Southwest Parallel Software) or the like, can be used. 

[0075] In the above, analysis and output of the results thereof, a computer such as UNIX, PC, Macintosh, and the 
45 like can be used. 

[0076] Contig obtained by the assembly can be analyzed using a graphical editor such as consed (The University 
of Washington) or the like. 

[0077] It is also possible to perform a series of the operations from the base call to the assembly in a lump using a 
script phredPhrap attached to the consed. 
50 [0078] As used herein, software will be understood to also be referred to as a comparator. 

(6) Determination of nucleotide sequence in gap part 

[0079] Each of the cosmids in the cosmid library constructed in the above (3) is prepared in the same manner as in 
55 the preparation of the double-stranded DNA plasmid described in the above (4-1 ). The nucleotide sequence at the end 
of the insert fragment of the cosmid is determined using a commercially available kit, such as ABI PRISM BigDye 
Terminator Cycle Sequencing Ready Reaction Kit (manufactured by PE Biosystems) according to the manufacture's 
instructions. 
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ch ai n linka^e^^^ (5) " C ° inCident With ^ Se " Uence Thus . 

out. Furthemwe the i^Itea^c^lS th „ T" C ° nt ' 9S 3re darified ' and mutual a,i 9 nment is 

* CorynebacteZl Xam^m Zcc Zt 1 PhyS,Ca ' maf>S to m3P the COSmids and the conti 9 s - «*e of 

moan -n, »' uram,cum Ar CC 13032, a physical map of AfoA Gen. Genef 252 255-265 i-ioor* r.»n ha ...^ 

K„ g T r tn T ence ln we "*» — * «— * - Wto^S" £ ISZSlKi. 

« K S^^n^' I"" nUC,e ° tide SeqUen ° e ° f the 9ap part can be dete ™"* d 

the end s^s oftte 1 S^conZTi' C '° ne 'T" 9 *" 93P ^ " aVa " ab,e ' primere to 
sequencing Is perfoled ^ToZTSnTZ^ ° NA fra9m6nt * » part iS ""P""*- ^ 

shotgun method in which the i^SS? 9 " nP ^ ° NA f ragment 88 3 template or b * the 

- S he ,;rr de DNA ,ra9ment is de ™ 

^StSS^Zlr?' T ^ Synth6SiZed USi " 9 AUTOFINISH function and 
method to im Pro ve thel^ 

™s£Tm™ ° f nUC ' e0tide SSqUenCe ° f mic ™*™ genome DNA using the nucieotide sequence represented 

mined using the nucleotide sequence repr^enJd b SEQ ID NO 1 ^ 38 dete,mmed ^ «" also be 
quence having a homology of BO^^SX^^S. !i ' 6 po,ynuc,eotide ha ™9 a nucleotide se- 
invention is within the scooe olttZ orient ? ""cleot.de sequence represented by SEQ ID NO:1 of the present 
a homology 2 s£i Im^m^e ^c^ZT ^ " po * nuc,eotide havi "9 ■ "^.eotide sequence'havinq 
* polynucleotide in whfc ™ fc„ SISiS^S^^ represented by SEQ ID NO:1 of the present invention- is a 
oligonucleotide compost o con ^ DNA Ca " be dete ™ ined «-"0 as a primer an 

1 . for example accord^ « p?£ " J? ° 50 nucleot,des « the nucleotide sequence represented by SEQ ID No- 
mination oTthe ^T'T D ? 38 3 temP ' ate - A paniCUlart » preferred * 
the interval of aboutSS to ^^S^^ST^T T 9 Se<,UenCes Which are P osi «°"° d - 
<o selected from DNAs^dfn S^nTS^a^S^ ^«9onuc.eotide having a nucleotide sequence 
otide in which the full -? !. metabolic pathway is particularly preferred. The polynucle- 
-.udes^^ « be d «*™* d «-"0 the olhoSJLSL 
teria. Such a 'polynucleotide^ I m ^n^SZ^f ™ T S microor 9 anism belonging to coryneform bac- 

belonging to the genus SSS^m 
* ryneoLLum gStamtoZ ' ° Preferat " y 3 P 0 '*™^* constituting a chromosome DNA of Co- 

2. Identification of ORF (open reading frame) and expression regulatory fragment and determination of the function of 

TI^T^VrfZ^T SeqUenCe d3ta ° f 9en ° me d6riVed fr ° m COf y" efo - ba <*-« ^ermined in 
thus dotormfned ORF^n FZZSZ™ ^ ^ ^ FUrtHemi0re - the - ^ 

^-^5^^ " T t C ' e °A ide SeqUenCS ° f mRNA Whfch be transla ted as an 

roooii Th « quence 10 mature to a Protein. A region of the DNA coding for the ORF of mRNA is also called obp 

Th 6 expression ^SSSS f™^ S ^ ' express "'" of or another sequence Ijgated operatably thereto, 

e^onotese,^::^^^^^ 
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enhancer, a silencer, a ribosome-binding sequence, a transcriptional termination sequence, and the like. In coryneform 
bacteria, an EMF is usually present in an intergenic segment (a fragment positioned between two genes; about 10 to 
200 nucleotides in length). Accordingly, an EMF is frequently present in an intergenic segment of 10 nucleotides or 
longer. It is also possible to determine or discover the presence of an EMF by using known EMF sequences as a target 
5 sequence or a target structural motif (or a target motif) using an appropriate software or comparator, such as FASTA 
(Proc. Natl. Acad. Sci. USA, 85: 2444-48 (1988)), BLAST (J. Mol. Biol., 275:403-410 (1990)) or the like. Also, it can 
be identified and evaluated using a known EMF-capturing vector (for example, pKK232-8; manufactured by Amersham 
Pharmacia Biotech). 

[0092] The term "target sequence" is used herein to refer to a nucleotide sequence composed of 6 or more nucle- 
10 otides, an amino acid sequence composed of 2 or more amino acids, or a nucleotide sequence encoding this amino 
acid sequence composed of 2 or more amino acids. A longer target sequence appears at random in a data base at 
the lower possibility. The target sequence is preferably about 1 0 to 1 00 amino acid residues or about 30 to 300 nucle- 
otide residues. 

[0093] The term "target structural motif" or "target motif" is used herein to refer to a sequence or a combination of 
is sequences selected optionally and reasonably. Such a motif is selected on the basis of the threedimensional structure 
formed by the folding of a polypeptide by means known to one of ordinary skill in the art. Various motives are known. 
[0094] Examples of the target motif of a polypeptide include, but are not limited to, an enzyme activity site, a protein- 
protein interaction site, a signal sequence, and the like. Examples of the target motif of a nucleic acid include a promoter 
sequence, a transcriptional regulatory factor binding sequence, a hair pin structure, and the like. 
20 [0095] Examples of highly useful EMF include a high-expression promoter, an inducible-expression promoter, and 
the like. Such an EMF can be obtained by positionally determining the nucleotide sequence of a gene which is known 
or expected as achieving high expression (for example, ribosomal RNA gene: GenBank Accession No. M16175 or 
Z46753) or a gene showing a desired induction pattern (for example, isocrtrate lyase gene induced by acetic acid: 
Japanese Published Unexamined Patent Application No. 56782/93) via the alignment with the full genome nucleotide 
25 sequence determined in the above item 1 , and isolating the genome fragment in the upstream part (usually 200 to 500 
nucleotides from the translation initiation site). It is also possible to obtain a highly useful EMF by selecting an EMF 
showing a high expression efficiency or a desired induction pattern from among promoters captured by the EMF- 
capturing vector as described above. 

[0096] The ORF can be identified by extracting characteristics common to individual ORFs, constructing a general 
30 model based on these characteristics, and measuring the conformity of the subject sequence with the model. In the 
identification, a software, such as GeneMark (Nuc. Acids. Res., 22: 4756-67 (1994): manufactured by GenePro)), 
GeneMark.hmm (manufactured by GenePro), GeneHacker {Protein, Nucleic Acid and Enzyme, 42 3001-07 (1997)), 
Glimmer (Nuc. Acids. Res. t 2&. 544-548 (1998): manufactured by The Institute of Genomic Research), or the like, can 
be used. In using the software, the default (initial setting) parameters are usually used, though the parameters can be 
35 optionally changed. 

[0097] In the above-described comparisons, a computer, such as UNIX, PC, Macintosh, or the like, can be used. 
[0098] Examples of the ORF determined by the method of the present invention include ORFs having the nucleotide 
sequences represented by SEQ ID NOS:2 to 3501 present in the genome of Corynebacterium glutamicum as repre- 
sented by SEQ ID NO:1 . In these ORFs, polypeptides having the amino acid sequences represented by SEQ ID NOS: 

40 3502 to 7001 are encoded. 

[0099] The function of an ORF can be determined by comparing the identified amino acid sequence of the ORF with 
known homologous sequences using a homology searching software or comparator, such as BLAST, FAST, Smith & 
Waterman (Mem. Enzym., 164: 765 (1 988)) or the like on an amino acid data base, such as Swith-Prot, PIR, GenBank- 
nr-aa, GenPept constituted by protein-encoding domains derived from GenBank data base, OWL or the like. 

45 [0100] Furthermore, by the homology searching, the identity and similarity with the amino acid sequences of known 
proteins can also be analyzed. 

[0101] With respect or the term "identity" used herein, where two polypeptides each having 10 amino acids are 
different in the positions of 3 amino acids, these polypeptides have an identity of 70% with each other. In case wherein 
one of the different 3 amino acids is analogue (for example, leucine and isoleucine), these polypeptides have a similarity 
so of 80%. 

[0102] As a specific example, Table 1 shows the registration numbers in known data bases of sequences which are 
judged as having the highest similarity with the nucleotide sequence of the ORF derived from Corynebacterium glutami- 
cum ATCC 13032, genes of these sequences, functions of these genes, and identities thereof compared with known 
amino acid translation sequences. 
55 [0103] Thus, a great number of novel genes derived from coryneform bacteria can be identified by determining the 
full nucleotide sequence of the genome derived from coryneform bacterium by the means of the present invention. 
Moreover, the function of the proteins encoded by these genes can be determined. Since coryneform bacteria are 
industrially highly useful microorganisms, many of the identified genes are industrially useful. 
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.» II 1 ° RF °" m ™ r »^»<»« «*g • polypeptide synthesizer tied on the above s^l™ Mon™. 

^"^Z^crSS^^^ 8 '°'>™*»«* the nuclide 

stringent conditions ' " P 0 *™"*"** <*** Iodizes with the polynucleotide under 

20 T01 1 01 qSSi^ . d set > uence b V degeneracy or a gene code. 

SEQ ID No! % >Z J ? 8 P o| y nuc,eotide comprising the nucleotide sequence represented by any one of 

[01?1 A po.y^^^ WhiCh hybridiZeS With the P^nucleolide underwent cTdlns 

Lion 1 f^SSS^tSS^ hi ; Str : n9ent ^ nditi ° nS iS 8 P 0 *"*** ^ainedby colony hybridi- 
nuclootideloquonc^ f^: , 5?S^^?S B r n °« I" USin9 ' ** * ^ the 

» Ummmtb 9 ti r ^^l^^ , ^. th f' ab ° VO SpOC,fic oxam P ,0S i^ludc a polynucleotide which can bo 

P^ared^roS 

Sonina S3 5? nd,2at ' 0n " n be camed out in accordance with known methods described in. for example Molecular 

with the polynucleotide under stringent conditions NOS.3502 to 6931 and a polynucleot.de which hybridizes 

'p"r^ invention incudes a po^nudeotide which is present in the 5" 

2 to 3431 in a <°NOS: 

encoding the a^ov^ 

sequence, a transcrptiona, te JS^i^'^Si ZT*"' 8 * 

c^tlLI^ 10 the f ° Ve PCR *•*"» unique indudes an oligonu- 

sequence oMhe ORF an an aZ^n ^1^^, ^ ,°J 1 ° t0 2 °° continuous nucleotides in the nucleotide 
to the oligonudeoude VS^S^S^ 90nUC, ? 0t,de «"P"*Q * sequence which is complementary 
sequence of 10 to 200 232^2^2??^ "T^™* COm P risin 9 a *°<V°™ which is the same as a 
to 3431 and an StaSnuSSl^ nUCle ° tlC,eS ° f the nucle °t'de sequence represented by any one of SEQ ID NOS 1 

primer and an antisense orim^r th» ohBuaJi ™ ™ NOS1 to 3431 • Wnen the Pnmers are used as a sense 

J—nuc^r^^ -P— ^ and the 

ST. ^"^CoufnulleSorZ" 0 '' I" 0 ',"?" ° l *''"""<»«* -"P** a sequence „„*. ,„. 

1 EXamp,eS ° f thG ana '° 90US ''^nucleotides include analogous oligonucleotides in which a phosphodiester 
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bond in an oligonucleotide is converted to a phosphorothioate bond, analogous oligonucleotides in which a phosphodi- 
esterbond in an oligonucleotide is convertedto an N3'-P5' phosphoamidate bond, analogous oligonucleotides in which 
ribose and a phosphodiesterbond in an oligonucleotide is converted to a peptide nucleic acid bond, analogous oligo- 
nucleotides in which uracil in an oligonucleotide is replaced with C-5 propynyluracil, analogous oligonucleotides in 
which uracil in an oligonucleotide is replaced with C-5 thiazoluracil, analogous oligonucleotides in which cytosine in 
an oligonucleotide is replaced with C-5 propynylcytosine, analogous oligonucleotides in which cytosine in an oligonu- 
cleotide is replaced with phenoxazine-modified cytosine, analogous oligonucleotides in which ribose in an oligonucle- 
otide is replaced with 2'-0-propylribose, analogous oligonucleotides in which ribose in an oligonucleotide is replaced 
with 2'-methoxyethoxyribose, and the like (Ceil Engineering, 16: 1463 (1997)). 

[0119] The above oligonucleotides and analogous oligonucleotides of the present invention can be used as probes 
for hybridization and antisense nucleic acids described below in addition to as primers. 

[0120] Examples of a primer for the antisense nucleic acid techniques known in the art include an oligonucleotide 
which hybridizes the oligonucleotide of the present invention under stringent conditions and has an activity regulating 
expression of the polypeptide encoded by the polynucleotide, in addition to the above oligonucleotide. 

3. Determination of isozymes 

[0121] Many mutants of coryneform bacteria which are useful in the production of useful substances, such as amino 
acids, nucleic acids, vitamins, saccharides, organic acids, and Ihe like, are obtained by the present invention. 
[0122] However, since the gene sequence data of the microorganism has been, to date, insufficient, useful mutants 
have been obtained by mutagenic techniques using a mutagen, such as nitrosoguanidine (NTG) or the like. 
[0123] Although genes can be mutated randomly by the mutagenic method using the above-described mutagen, all 
genes encoding respective isozymes having similar properties relating to the metabolism of intermediates cannot be 
mutated. In the mutagenic method using a mutagen, genes arc mutated randomly. Accordingly, harmful mutations 
worsening culture characteristics, such as delay in growth, accelerated foaming, and the like, might be imparted at a 
great frequency, in a random manner. 

[0124] However, if gene sequence information is available, such as is provided by the present invention, it is possible 
to mutate all of the genes encoding target isozymes. In this case, harmful mutations may be avoided and the target 
mutation can be incorporated. 

[0125] Namely, an accurate number and sequence information of the target isozymes in coryneform bacteria can be 
obtained based on the ORF data obtained in the above item 2. By using the sequence information, all of the target 
isozyme genes can be mutated into genes having the desired properties by, for example, the site-specific mutagenesis 
method described in Molecular Cloning, 2nd ed. to obtain useful mutants having elevated productivity of useful sub- 
stances. 

4. Clarification or determination of biosynthesis pathway and signal transmission pathway 

[0126] Attempts have been made to elucidate biosynthesis pathways and signal transmission pathways in a number 
of organisms, and many findings have been reported. However, there are many unknown aspects of coryneform bac- 
teria since a number of genes have not been identified so far. 
[0127] These unknown points can be clarified by the following method. 

[0128] The functional information of ORF derived from coryneform bacteria as identified by the method of above item 
2 is arranged. The term "arranged" means that the ORF is classified based on the biosynthesis pathway of a substance 
or the signal transmission pathway to which the ORF belongs using known information according to the functional 
information. Next, the arranged ORF sequence information is compared with enzymes on the biosynthesis pathways 
or signal transmission pathways of other known organisms. The resulting information is combined with known data on 
coryneform bacteria. Thus, the biosynthesis pathways and signal transmission pathways in corynerorm bacteria, which 
have been unknown so far, can be determined. 

[0129] As a result that these pathways which have been unknown or unclear hitherto are clarified, a useful mutant 
for producing a target useful substance can be efficiently obtained. 

[0130] When the thus clarified pathway is judged as important in the synthesis of a useful product, a useful mutant 
can be obtained by selecting a mutant wherein this pathway has been strengthened. Also, when the thus clarified 
pathway : js judged as not important in the biosynthesis of the target useful product, a useful mutant can be obtained 
by selecting a mutant wherein the utilization frequency of this pathway is lowered. 

5. Clarification or determination of useful mutation point 

[0131] Many useful mutants of coryneform bacteria which are suitable for the production of useful substances, such 
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as amino acids nucleic acids, vitamins, saccharides, organic acids, and the like, have been obtained. However it is 
hardly known which mutation point is imparted to a gene to improve the productivity 

onhfLn^P nMr^fh 0 " P ° intscontained in Production strains can be identified by comparing desired sequences 
n ,h!^ «je Production strains obtained from coryneform bacteria by the mutagenic technique with the 

ine methods of the above items 1 and 2 and analyzing them 

muJSion DoT on'th!f NB 7J ati ° n P0 ' ntS contributin 9 t0 the P roducti0 " can be easily specified from among these 
Zsms th. 't . kn ° Wn ,nformation relati "9 10 the ^tabolic pathways, the metabolic regulatory mech- 
anisms, the structure activity correlation of enzymes, and the like 

cTnte inZuceTnto^iTH T^T ^ b8Sed °" k "° Wn data ' the mutation P° inte «« ^entified 

wTefhe"^ 

whether or not any positive effect can be achieved on the production. 

Jtll 5] ■ F of Xample ' by com P arin 9 tne nucleotide sequence of homoserine dehydrogenase gene horn of a Ivsine- 
Stram ° f C °^ etecte "'" n 9lutamicum (App,. Microti*. Biotecnncl., 3^J^mlV^Z- 

Tnventn a'rSioS^ 

wTweniified J 2" in Z replacement in which valine at the 59-positJon is replaced with a.anine (Val59A.a) 

memod ^ Lh k by ,ntroducin 9 this ™ tati °n '*° the ATCC 13032 strain by the gene replacemen 

onys^ie indi ° ateS that thlS mUtati ° n iS a " effeCtiVe mUtati ° n contribu «"9 "> *■ production 

EL™?** by COmparing the nucleotide sequence of pyruvate carboxylase gene pyc of the B-6 strain with the 

Tl t J2T h" ^ reP,aCed ^ S6rine ( Pro458Ser > ™°* id «"tified. A strain obtained by introducing^ muta on 
■TtaEET dUC,n9 f ra ! n ° f N °- 58 < FERM BP 7134 ) * Corynebacterium giutamicumtree of this mutatioTshows 
muZT JT!? pr ° dUCt,vi1y in «W*»n with the No. 58 strain, which indicates that this mutation is an cffocZ 
mutation contributing to the production of lysine cnocirvc 

tationTela^o" SS^SS?" n ^ * 9 lu c°so-6-phos P hate dehydrogenase was specified as an effective mu- 
» f!C?5 *T by det6Cting 9'"cose-6-phosphate dehydrogenase gene zwf of the B-6 strain, 
the Q^LosmonT^ « ^ S,ne - productivit y of Corynebacterium giutamicum was improved by replacing the base at 
^rf? T of 1 as P artok,nase 9ene lysC of the Corynebacterium glutamicum ATCC 13032 genome with cytosine 

* ™th^ S °' 88 a "°* errnetnod to examine whether or not the identified mutation point is an effective mutation there 
Sriin £ St J T mUtati0 " P ° SSeSSed by the 'y sine -P rod ^ng strain is returned to the sequence of a 

5J ^L^SET? T ° d Wh6ther ° r n0t * h9S H ne9atiVe inflUence ° n the »* ine Productivity.yor 
w^urrS to a ZSZ r6pi *° e ™ nt mutation Val59A,a P ossess * d by hom of the lysine-producing B-6 strain 
^TSSEJJSIT am,n ° acid seauence > the 'y sine Productivity was lowered in comparison with the B-6 strain, 
roi^ii mIZ B mUtat,on 18 an effective muta «on contributing to the production of lysine. 

SSLSS^ 15 POi " tS ^ be T' 6 effident,y and com P rehen «^'y ^racted by combining, if needed, the 
uima array analysis or proteome analysis described below. 

6. Method of breeding industrially advantageous production strain 

pmductio'n ofth^TT! ^If' 0 * * C ° nStmCt P roduction strains - whicn «re used industrially in the fermentation 
aZ lZ Zl £ substances, such as amino acids, nucleic acids, vitamins, saccharides, organic acids 

orl^krand Z^r^"^ "* brSedin9 °" mUtageneSiS USing ™ ta 9 er *. ^ °* NTG 

t£S DNA r Sn!r'iT ny 65 ° f imPr ° Ved Pr ° dUCli0n SlrainS haVe Deen made lhrou 9 h *e °f recom- 

taZd bv « teChn,q " es , ,n breedmg ' however ' most of the P are "t production strains to be improved are mutants ob- 
SI rTT^ mutagenic procedure (W. Leuchtenberger, Amino Acids - Technical Production and Use In 

he°m P 4ef oSe") SeCO " d editi0 "' V ° 1- 6 ' Pr0dUCtS ° f Prima,y metabolism VCH Vertagsgesellschaft mbH. Wein- 

suitrLm!'^? mUtagcncSiS ; mothods havc ,ar 9°'y contributed to the progress of the fermentation industry, they 

mani nlLT P * rand ° m introduction of mutations into every part of the chromosome kince 

byTe^dor^ ^^T^ 3 Sin9 ' e Chromosome each « me a *™ - -proved, a production strain obtained 

consumotion ZT^L T f " 9 * 9enera " y inferi ° r in pr ° perties (for examp,e - showin 9 P°°r 9'°wth, delayed 
consumptton of sacchandes, and poor resistance to stresses such as temperature and oxygen) to a wild type strain 

n*ed JESSIE 11 ''? M faMin9 10 eStabMSh " SUffident ' y 6leVa ' ed P~ d "CtMty^bein g friuent^nS: 
nated with m,scellaneous bactera, requiring troublesome procedures in culture maintenance, and the like and. in its 
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turn, elevating the production cost in practice. In addition, the improvement in the productivity is based on random 
mutations and thus the mechanism thereof is unclear. Therefore, it is very difficult to plan a rational breeding strategy 
for the subsequent improvement in the productivity. 

[0144] According to the present invention, effective mutation points contributing to the production can be efficiently 
specified from among many mutation points accumulated in the chromosome of a production strain which has been 
bred from coryneform bacteria and, therefore, a novel breeding method of assembling these effective mutations in the 
coryneform bacteria can be established. Thus, a useful production strain can be reconstructed. It is also possible to 
construct a useful production strain from a wild type strain. 
[0145] Specifically, a useful mutant can be constructed in the following manner. 

[0146] One of the mutation points is incorporated into a wild type strain of coryneform bacteria. Then, it is examined 
whether or not a positive effect is established on the production. When a positive effect is obtained, the mutation point 
is saved. When no effect is obtained, the mutation point is removed. Subsequently, only a strain having the effective 
mutation point is used as the parent strain, and the same procedure is repeated. In general, the effectiveness of a 
mutation positioned upstream cannot be clearly evaluated in some cases when there is a rate-determining point in the 
downstream of a biosynthesis pathway. It is therefore preferred to successively evaluate mutation points upward from 
downstream. 

[0147] By reconstituting effective mutations by the method as described above in a wild type strain or a strain which 
has a high growth speed or the same ability to consume saccharides as the wild type strain, It is possible to construct 
an industrially advantageous strain which is free of troubles in the previous methods as described above and to conduct 
fermentation production using such strains within a short time or at a higher temperature. 

[0148] For example, a iysine-producing mutant B-6 {AppL Microbiol. Biotechnot., 32: 262-273 (1989)), which is ob- 
tained by multiple rounds of random mutagenesis from a wild type strain Corynebacterium giutamicum ATCC 13032, 
enables lysine fermentation to be performed at a temperature between 30 and 34°C but shows lowered growth and 
lysine productivity at a temperature exceeding 34°C. Therefore, the fermentation temperature should be maintained 
at 34°C or lower. In contrast thereto, the production strain described in the above item 5, which is obtained by recon- 
stituting effective mutations relating to lysine production, can achieve a productivity at 40 to 42°C equal or superior to 
the result obtained by culturing at 30 to 34°C. Therefore, this strain is industrially advantageous since it can save the 
load of cooling during the fermentation. 

[0149] When culture should be carried out at a high temperature exceeding 43°C, a production strain capable of 
conducting fermentation production at a high temperature exceeding 43°C can be obtained by reconstituting useful 
mutations in a microorganism belonging to the genus Corynebacterium which can grow at high temperature exceeding 
43°C. Examples of the microorganism capable of growing at a high temperature exceeding 43°C include Corynebac- 
terium thermoaminogenes, such as Corynebacterium thermoaminogenes FERM 9244, FERM 9245, FERM 9246 and 
FERM 9247. 

[0150] A strain having a further improved productivity of the target product can be obtained using the thus recon- 
structed strain as the parent strain and further breeding it using the conventional mutagenesis method, the gene am- 
plification method, the gene replacement method using the recombinant DNA technique, the transduction method or 
the cell fusion method. Accordingly, the microorganism of the present invention includes, but is not limited to, a mutant, 
a cell fusion strain, a transformant, a transductant or a recombinant strain constructed by using recombinant DNA 
techniques, so long as it is a producing strain obtained via the step of accumulating at least two effective mutations in 
a coryneform bacteria in the course of breeding. 

[0151] When a mutation point judged as being harmful to the growth or production is specified, on the other hand, 
it is examined whether or not the producing strain used at present contains the mutation point. When it has the mutation , 
it can be returned to the wild type gene and thus a further useful production strain can be bred. 
[0152] The breeding method as described above is applicable to microorganisms, other than coryneform bacteria, 
which have industrially advantageous properties (for example, microorganisms capable of quickly utilizing less expen- 
sive carbon sources, microorganisms capable of growing at higher temperatures). 

7. Production and utilization of polynucleotide array 

(1) Production of polynucleotide array 

[0153] A polynucleotide array can be produced using the polynucleotide or oligonucleotide of the present invention 
obtained in the above items 1 and 2. 

[0154] Examples include a polynucleotide array comprising a solid support to which at least one of a polynucleotide 
comprising the nucleotide sequence represented by SEQ ID NOS:2 to 3501 , a polynucleotide which hybridizes with 
the polynucleotide under stringent conditions, and a polynucleotide comprising 10 to 200 continuous nucleotides in 
the nucleotide sequence of the polynucleotide is adhered; and a polynucleotide array comprising a solid support to 
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at one S??n ^K!? 0,kle e " COdin9 * po| yP e * tide comprising the amino acid sequence represented by 
condlns and a n„, I S * 7 °°V * P°'>™ cle ° tide whic " hybridizes with the polynucleotide under stringer," 
ToS is adhered " * 2 °° C ° ntinU ° US ***** ,n the nUC,eotide se < uences ° f P°^- 

DNrLrotrv^nd'a^ 8 ° f ^ PreS6nt inVenti ° n inC ' Ude SUbStra,eS known in the art - such 38 ■ DNA chip, a 
f«™£S and a DNA macroarray, and the like, and comprises a solid support and plural polynucleotides or 
ragments thereof wh>ch are adhered to the surface of the solid support ynuceonaes or 

0 Si n«! 6S ° f , th * SO,id s f u PP° rt include 3 9las S plate, a nylon membrane, and the like. 

S J h ^S^^^T^ th T 60f adher6d l ° the SUrf3Ce ° f the solid su PP° rt can be to the 

IrtolZu* L k n PP " 9 * e 9eneral technique for preparing arrays. Namely, a method in which thev are 

TteTa^Sr T^T, "°" d SUPP °* fore ™^ to which a%olycation such as polyhys ne o nZ Hke 

tlZ^^lT^ S0 " d Pr0dUCt ™ ^ ^ 38 th6 SO ' id SUpP ° rt °< th P eUnuc,eotide^Ta"X 

otides of ^VSresemTnl^ ^nucleotides adhered to the solid support, the polynucleotides and o.igonucle- 
rnieoi presem lnv ention obtained in the above items 1 and 2 can be used 

toth^solL^ 

n,£ni ? PP 1 8 h ' 9h denS,ty> thOUgh a hi 9" fixation de nsity is not always necessary. 

K Vra p sC 0 ^ ^ G r M sJf7 v ^^ hi f °r? densiiy ' such 88 an arrayer rob ° i ° r ine nke - * — **«• 

rnifiii a? .u ( , Arrayer), and the commercially available product can be used. 

PhotoLoara'nhJ l2rH UC l° tid ! S °l ^ i™*™ be Resized directly on the solid support by the 

KS^JK^tL'? t k !, ** 20 " 24 < 1 " 9)) - thiS method ' a linker havi "9 a P~t^ive group 

Z iSdTatc w^hTaht t * 9 J I?" ' S firSt Sdhered t0 a SOHd SUpp0rt ' such 88 a slide 9 ,ass or the like. Then, ft 
aUTstfr, 1m f hr ° U9 3 maSk (a P h °t 0,i thograph mask) permeating light exclusively at a definite part of the 

procedur ^ES^E Z?T ? nUC ' e ° tide eXclusivel V al *» '^diated part. By repeating this 

tZ art! i2 XI 2? ? ' ? aV ' n9 8 deS ' red diffe ™t from each other can be synthesized^ respec- 

live parts. Usually, the oligonucleotides to be synthesized have a length of 1 0 to 30 nucleotides 

(2) Use of polynucleotide array 
J0162J The following procedures (a) and (b) can be carried out using the polynucleotide array prepared in the above 

J£KE^ baCteriUm mUtant 3nd ana ' ysiS ° f expression ™** - d -pression 

KnS^^^wi 60 "" 9 3 96ne derived from a mutant of coryneform bacteria or an examined gene to the followina 

tS^^ST" 1 ^ ° f thS 96ne Ca " te idenWied ° r the expression " and ^Z^Jn^l 

(i) producing a polynucleotide array by the method of the above (1)* 

a mu^o^ ° n . the P0,ynuc,e0tide arra * together wrth the labeled gene derivedfrom 

conimo^ ^ nef0rm baCtenUm US ' ng the P°^^eotide array produced in the above (i) under hybridization 
(Hi) detecting the hybridization; and 
(iv) analysing the hybridisation data. 

S^Z^JSZl IT f ° ,C0rynef0mi bacteria ° r examined gene indude a gene renting to 

m amm ° aCidS ' nUC ' eiC addS ' Vitamins ' strides, organic acids, and ana- 
[0165] The method will bo described in detail. 

otid^irratlsX "^.eotide pofymorphism (SNP) in a human region of 2,300 kb has been identified using polynucle- 

T^^^?B%VS^ S 1 I*} i" th6 meth ° d ° f identifying SNP and methoidesenbed 
thelS^f L ■ Na l L ^ ^ USA ' 9& 12833 - 38 (1999 > ; Science ' 284 - 1 «0-23 (1999). and 

~Ce e S^ in the above < 1 > and a "«Wc ™« molecule (ON A, RNA) derivedfrom 

an armno^cir! n 8 , ? hybnd,7at,on, a mutation point of a useful mutant, which is useful in producing 

an ammo acd, a nucleic acid, a vrtamin. a saccharide, an organic acid, or the like can be identified and the gene 
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expression amount and the expression profile thereof can be analy7ed. 

[0167] The nucleic acid molecule (DIM A, RNA) derived from the coryneform bacteria can be obtained according to 
the general method described in Molecular Cloning, 2nd ed. or the like. mRNA derived from Corynebacterium glutamic 
cum can also be obtained by the method of Bormann et al. {Molecular Microbiology, 6: 31 7-326 (1 992)) or the like. 
[0168] Although ribosomal RNA (rRNA) is usually obtained in large excess in addition to the target mRNA, the anal- 
ysis is not seriously disturbed thereby. 

[0169] The resulting nucleic acid molecule derived from coryneform bacteria is labeled. Labeling can be carried out 
according to a method using a fluorescent dye, a method using a radioisotope or the like. 

[0170] Specific examples include a labeling method in which psoralen-biotin is crosslinked with RNA extracted from 
a microorganism and, after hybridization reaction, a fluorescent dye having streptoavidin bound thereto is bound to 
the biotin moiety (Nat. BiotechnoL, 16: 45-48 (1998)); a labeling method in which a reverse transcription reaction is 
carried out using RNA extracted from a microorganism as a template and random primers as primers, and dUTP having 
a fluorescent dye (for example, Cy3, Cy5) (manufactured by Amersham Pharmacia Biotech) is incorporated into cDNA 
(Proc. Natl. Acad. ScL USA, 96: 12833-38 (1999)); and the like. 

[0171] The labeling specificity can be improved by replacing the random primers by sequences complementary to 
the 3*-end of ORF (J. BacterioL, 181: 6425-40 (1999)). 

[0172] In the hybridization method, the hybridization and subsequent washing can be carried out by the general 
method {Nat. Bioctechnol., 14: 1675-80 (1996), or the like). 

[0173] Subsequently, the hybridization intensity is measured depending on the hybridizaLion amounl of the nucleic 
acid molecule used in the labeling. Thus, the mutation point can be identified and the expression amount of the gene 
can be calculated. 

[0174] The hybridization intensity can be measured by visualizing the fluorescent signal, radioactivity, luminescence 
dose, and the like, using a laser confocal microscope, a CCD camera, a radiation imaging device (for example, STORM 
manufactured by Amersham Pharmacia Biotech), and the like, and then quantifying the thus visualized data. 
[0175] A polynucleotide array on a solid support can also be analyzed and quantified using a commercially available 
apparatus, such as GMS418 Array Scanner (manufactured by Takara Shuzo) or the like. 

[0176] The gene expression amount can be analyzed using a commercially available software (for example, ImaGene 
manufactured by Takara Shuzo; Array Gauge manufactured by Fuji Photo Film; ImageQuant manufactured by Amer- 
sham Pharmacia Biotech, or the like). 

[0177] A fluctuation in the expression amount of a specific gene can be monitored using a nucleic acid molecule 
obtained in the time course of culture as the nucleic acid molecule derived from coryneform bacteria. The culture 
conditions can be optimized by analyzing the fluctuation. 

[0178] The expression profile of the microorganism at the total gene level (namely, which genes among a great 
number of genes encoded by the genome have been expressed and the expression ratio thereof) can be determined 
using a nucleic acid molecule having the sequences of many genes determined from the full genome sequence of the 
microorganism. Thus, the expression amount of the genes determined by the full genome sequence can be analyzed 
and, in its turn, the biological conditions of the microorganism can be recognized as the expression pattern at the full 
gene level. 

(b) Confirmation of the presence of gene homologous to examined gene in coryneform bacteria 

[0179] Whether or not a gene homologous to the examined gene, which is present in an organism other than co- 
ryneform bacteria, is present in coryneform bacteria can be detected using the polynucleotide array prepared in the 
above (1). 

[0180] This detection can be carried out by a method in which an examined gene which is present in an organism 
other than coryneform bacteria is used instead of the nucleic acid molecule derived from coryneform bacteria used in 
the above identification/analysis method of (1). 

8. Recording medium storing full genome nucleotide sequence and ORF data and being readable by a computer and 
methods for using the same 

[0181] The term "recording medium or storage device which is readable by a computer" means a recording medium 
or storage medium which can be directly readout and accessed with a computer. Examples include magnetic recording 
media, such as a floppy disk, a hard disk, a magnetic tape, and the like; optical recording media, such as CD-ROM, 
CD-R, CD-RW, DVD-ROM, DVD-RAM, DVD-RW, and the like; electric recording media, such as RAM, ROM, and the 
like; and hybrids in these categories (for example, magnetic/optical recording media, such as MO and the like). 
[0182] Instruments for recording or inputting in or on the recording medium or instruments or devices for reading out 
the information in the recording medium can be appropriately selected, depending on the type of the recording medium 
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and the access device utili7ed. Also, various data processing programs, software, comparator and formats are used 
for recording and utilizing the polynucleotide sequence information or the like, of the present invention in the recording 
medium. The information can be expressed in the form of a binary file, a text file or an ASCII file formatted with com- 
mercially available software, for example. Moreover software for accessing the sequence information is available and 
known to one of ordinary skill in the art. 

[0183] Examples of the information to be recorded in the above-described medium include the full genome nucleotide 
sequence information of coryneform bacteria as obtained in the above item 2, the nucleotide sequence information of 
ORF, the amino acid sequence information encoded by the ORF, and the functional information of polynucleotides 
coding for the amino acid sequences. 

[0184] The recording medium or storage device which is readable by a computer according to the present invention 
refers to a medium in which the information of the present invention has been recorded. Examples include recording 
media or storage devices which are readable by a computer storing the nucleotide sequence information represented 
by SEQ ID NOS:1 to 3501, the amino acid sequence information represented by SEQ ID NOS:3502 to 7001, the 
functional information of the nucleotide sequences represented by SEQ ID NOS:1 to 3501 , the functional information 
of the amino acid sequences represented by SEQ ID NOS:3502 to 7001 , and the information listed in Table 1 below 
and the like. 

9. System based on a computer using the recording medium of the present invention which is readable by a computer 

[0185] The term "system based on a computer 1 ' as used herein refers a system composed of hardware device(s), 
software device(s), and data recording device(s) which are used for analyzing the data recorded in the recording me- 
dium of the present invention which is readable by a computer. 

[01 86] The hardware device(s) are, for example, composed of an input unit, a data recording unit, a central processing 
unit and an output unit collectively or individually. 

[0187] By the software device(s), the data recorded in the recording medium of the present invention are searched 
or analyzed using the recorded data and the hardware device(s) as described herein. Specifically, the software device 
(s) contain at least one program which acts on or with the system in order to screen, analyze or compare biologically 
meaningful structures or information from the nucleotide sequences, amino acid sequences and the like recorded in 
the recording medium according to the present invention. 

[0188] Examples of the software device(s) for identifying ORF and EMF domains include GeneMark (Nuc. Acids. 
Res., 22. 4756-67 (1994)), GeneHacker (Protein, Nucleic Acid and Enzyme, 42. 3001-07 (1997)), Glimmer (The Insti- 
tute of Genomic Research; Nuc. Acids. Res., 26: 544-548 (1 998)) and the like. In the process of using such a software 
device, the default (initial setting) parameters are usually used, although the parameters can be changed, if necessary, 
in a manner known to one of ordinary skill in the art. 

[0189] Examples of the software device(s) for identifying a genome domain or a polypeptide domain analogous to 
the target sequence or the target structural motif (homology searching) include FASTA, BLAST, Smith-Wajerman, 
GenetyxMac (manufactured by Software Development), GCG Package (manufactured by Genetic Computer Group), 
GenCore (manufactured by Compugen), and the like. In the process of using such a software device, the default (initial 
setting) parameters are usually used, although the parameters can be changed, if necessary, in a manner known to 
one of ordinary skill in the art. 

[0190] Such a recording medium storing the full genome sequence data is useful in preparing a polynucleotide array 
by which the expression amount of a gene encoded by the genome DNA of coryneform bacteria and the expression 
profile at the total gene level of the microorganism, namely, which genes among many genes encoded by the genome 
have been expressed and the expression ratio thereof, can be determined. 

[0191] The data recording device(s) provided by the present invention are, for example, memory devtee(s) for re- 
cording the data recorded in the recording medium of the present invention and target sequence or target structural 
molif data, or Lhe like, and a memory accessing device(s) Tor accessing Ihe same. 

[0192] Namely, the system based on a computer according to the present invention comprises the following: 

(i) a user input device that inputs the information stored in the recording medium of the present invention, and 
target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the information stored in the recording medium of the present invention with the 
target sequence or target structure motif information, recorded by the data storing device of (ii) for screening and 
analyzing nucleotide sequence information which is coincident with or analogous to the target sequence or target 
structure motif information; and 

(iv) an output device that shows a screening or analy7ing result obtained by the comparator. 
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[01 93] This system is usable in the methods in items 2 to 5 as described above for searching and analyzing the ORF 
and EMF domains, target sequence, target structural motif, etc. of a coryneform bacterium, searching homologs, 
searching and analyzing isozymes, determining the biosynthesis pathway and the signal transmission pathway, and 
identifying spots which have been found in the proteome analysis. The term "homologs" as used herein includes both 
of orthologs and paralogs. 

10. Production of polypeptide using ORF derived from coryneform bacteria 

[01 94] The polypeptide of the present invention can be produced using a polynucleotide comprising the ORF obtained 
in the above item 2. Specifically, the polypeptide of the present invention can be produced by expressing the polynu- 
cleotide of the present invention or a fragment thereof in a host cell, using the method described in Molecular Cloning, 
2nd ed., Current Protocols in Molecular Biology, and the like, for example, according to the following method. 
[01 95] A DNA fragment having a suitable length containing a part encoding the polypeptide is prepared from the full 
length ORF sequence, if necessary. 

[0196] Also, DNA in which nucleotides in a nucleotide sequence at a part encoding the polypeptide of the present 
invention are replaced to give a codon suitable for expression of the host cell, if necessary. The DNA is useful for 
efficiently producing the polypeptide of the present invention. 

[0197] A recombinant vector is prepared by inserting the DNA fragment into the downstream of a promoter in a 
suitable expression vector. 

[01 98] The recombinant vector is introduced to a host cell suitable for the expression vector. 

[0199] Any of bacteria, yeasts, animal cells, insect cells, plant cells, and the like can be used as the host cell so long 
as it can be expressed in the gene of interest. 

[0200] Examples of the expression vector include those which can replicate autonomously in the above-described 
host coll or can bo integrated into chromosome and have a promoter at such a position that the DNA encoding the 
polypeptide of the present invention can be transcribed. 

[0201] When a procaryote cell, such as a bacterium or the like, is used as the host cell, it is preferred that the 
recombinant vector containing the DNA encoding the polypeptide of the present invention can replicate autonomously 
in the bacterium and is a recombinant vector constituted by, at least a promoter, a ribosome binding sequence, the 
DNA of the present invention and a transcription termination sequence. A promoter controlling gene can also be con- 
tained therewith in operable combination. 

[0202] Examples of the expression vectors include a vector plasmid which is replicable in Corynebacterium glutami- 
cum. such as pCGI (Japanese Published Unexamined Patent Application No. 134500/82). pCG2 (Japanese Published 
Unexamined Patent Application No. 35197/83), pCG4 (Japanese Published Unexamined Patent Application No. 
183799/82), pCG11 (Japanese Published Unexamined Patent Application No. 134500/82), pCG116, pCE54 and 
pCB101 (Japanese Published Unexamined Patent Application No. 105999/83), pCE51 , pCE52 and pCE53 (Mol. Gen. 
Genet, 196: 1 75-1 78 (1 984)), and the like; a vector plasmid which is replicable in Escherichia coti, such as pET3 and 
pET11 (manufactured by Stratagene), pBAD, pThioHis and pTrcHis (manufactured by Invitrogen), pKK223-3 and 
pGEX2T (manufactured by Amersham Pharmacia Biotech), and the like; and pBTrp2, pBTad and pBTac2 (manufac- 
tured by Boehringer Mannheim Co.), pSE280 (manufactured by Invitrogen), pGEMEX-1 (manufactured by Promega), 
pQE-8 (manufactured by QIAGEN), pKYP10 (Japanese Published Unexamined Patent Application No. 110600/83), 
, pKYP200 (Agric. Biol. Chem., 48: 669 (1984)), pLSA1 (Agric. Biol. Chem., 53: 277 (1989)), pGEL1 (Proc. Natl. Acad. 
Sd. USA, 82: 4306 (1 985)), pBiuescript 1 1 SK(-) (manufactured by Stratagene), pTrs30 (prepared from Escherichia colt 
JM109/pTrS30 (FERM BP-5407)), pTrs32 (prepared from Escherichia coli JM109/pTrS32 (FERM BP-5408)), pGHA2 
(prepared from Escherichia coli IGHA2 (FERM B-400), Japanese Published Unexamined Patent Application No. 
221 091 /85), pGKA2 (prepared from Escherichia CO//IGKA2 (FERM BP-6798), Japanese Published Unexamined Patent 
Application No. 221091/85), pTerm2 (U.S. Patents 4,686,191, 4,939,094 and 5,160,735), pSupex, pUB110, pTP5 f 
pC1 94 and pEG400 (J. Bacterid., 172: 2392 (1 990)), pGEX (manufactured by Pharmacia), pET system (manufactured 
by Novagen), and the like. 

[0203] Any promoter can be used so long as it can function in the host cell. Examples include promoters derived 
from Escherichia coli, phage and the like, such as trp promoter (P trp ) , lac promoter, P L promoter, P R promoter, T7 
promoter and the like. Also, artificially designed and modified promoters, such as a promoter in which two Pfrp are 
linked in scries (P +rp x2) , fac promoter, /acT7 promoter tofl promoter and the like, can be used. 
[0204] It is preferred to use a plasmid in which the space between Shine-Dalgamo sequence which is the ribosome 
binding sequence and the initiation codon is adjusted to an appropriate distance (for example, 6 to 18 nucleotides). 
[0205] The transcription termination sequence is not always necessary for the expression of the DNA of the present 
invention. However, it is preferred to arrange the transcription terminating sequence at just downstream of the structural 
gene. 

[0206] One of ordinary skill in the art will appreciate that the codons of the above-described elements may be opti- 
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mized, in a known manner, depending on the host cells and environmental conditions utilized. 
[0207] Examples of the host cell include microorganisms belonging to the genus Escherichia, the genus Serratia, 
the genus Bacillus, the genus Brevibacterium, the genus Corynebacterium, the genus Microbacterium, the genus Pseu- 
domonas, and the like. Specific examples include Escherichia coli XL1-Blue, Escherichia co//XL2-Blue. Escherichia 
coli DH1, Escherichia co//MC1000 : Escherichia coli KY3276, Escherichia coli W1485, Escherichia coli JM109, Es- 
cherichia coli HB101, Escherichia coli No. 49, Escherichia coli Escherichia coli NY49, Escherichia coli GI698, 

Escherichia coli TB1, Serratia ficaria, Serratia fonticoia, Serratia liquefaciens, Serratia marcescens t Bacillus subtiits] 
Bacillus amyloliquefaciens, Corynebacterium ammonia genes, Brevibacterium immariophilum ATCC 14068, Brevibac- 
terium saccharoiyticum ATCC 14066, Corynebacterium glutamicum ATCC 13032, Corynebacterium glutamicum ATCC 
1 3869, Corynebacterium glutamicum ATCC 1 4067 (prior genus and species: Brevibacterium flavum), Corynebacterium 
glutamicum ATCC 13869 (prior genus and species: Brevibacterium lactofermentum, or Corynebacterium lactofermen- 
tum), Corynebacterium acetoacidophilum ATCC 1 3B70, Corynebacterium thermoaminogenes FERM 9244. Microbac- 
terium ammoniaphilum ATCC 1 5354, Pseudomonas putida, Pseudomonas sp. D-01 1 0, and the like. 
[0208] When Corynebacterium glutamicum or an analogous microorganism is used as a host, an EMF necessary 
for expressing the polypeptide is not always contained in the vector so long as the polynucleotide of the present in- 
vention contains an EMF. When the EMF is not contained in the polynucleotide, it is necessary to prepare the EMF 
separately and ligate it so as to be in operable combination. Also, when a higher expression amount or specific ex- 
pression regulation is necessary, it is necessary to ligate the EMF corresponding thereto so as to put the EMF in 
operable combination with the polynucleotide. Examples or using an externally ligaled EMF are disclosed in Microbh 
20 ology, 142. 1297-1309 (1996). 

[0209] With regard to the method for the introduction of the recombinant vector, any method for introducing DNA into 
the above-described host cells, such as a method in which a calcium ion is used {Proc. NatL Acad, ScL USA, 69 2110 
(1972)), a protoplast method (Japanese Published Unexamined Patent Application No. 2483942/88), the methods 
described in Gene, 17:107 (1982) and Molecular & General Genetics, 168: 111 (1979) and the like, can bo used. 
[0210] When yeast is used as the host cell, examples of the expression vector include pYES2 (manufactured by 
Invitrogen), YEp13 (ATCC 37115), YEp24 (ATCC 37051), YCp50 (ATCC 37419), pHS19, pHS15 f and the like. 
[0211] Any promoter can be used so long as it can be expressed in yeast. Examples include a promoter of a gene 
in the glycolytic pathway, such as hexose kinase and the like, PH05 promoter, PGK promoter, GAP promoter, ADH 
promoter, gal 1 promoter, gal 10 promoter, a heatshock protein promoter, MFal promoter, CUP 1 promoter, and the like. 
[0212] Examples of the host cell include microorganisms belonging to the genus Saceharomyces, the genus 
Schizosaccharomyces, the genus Kluyveromyces, the genus Trichosporon, the genus Schwanniomyces, the genus 
Ptchia, the genus Candida and the like. Specific examples include Saceharomyces cerevisiae, Schizosaccharomyces 
pombe, Kluyveromyces lactis, Trichosporon pulluians, Schwanniomyces alluvius, Candida utilis and the like. 
[021 3] With regard to the method for the introduction of the recombinant vector, any method for introducing DNA into 
yeast, such as an electroporation method (Methods. EnzymoL, 194: 182 (1990)), a spheroplast method (Proc. NatL 
Acad. ScL USA, 75: 1929 (1978)), a lithium acetate method (J. Bacterid., 153: 163 (1983)), a method described in 
Proc. NatL Acad. Set. USA, 75: 1929 (1978) and the like, can be used. 

[0214] When animal cells are used as the host cells, examples of the expression vector include pcDNA3.1 , pSinRepS 
and pCEP4 (manufactured by Invitorogen), pRev-Tre (manufactured by Clontech), pAxCAwt (manufactured by Takara 
Shuzo), pcDNAI and pcDM8 (manufactured by Funakoshi), pAGE107 (Japanese Published Unexamined Patent Ap- 
plication No. 22979/91 ; Cytotechnology, 3:133 (1990)), pAS3-3 (Japanese Published Unexamined Patent Application 
No. 227075/90), pcDM8 (Nature, 329 840 (1987)), pcDNAI/Amp (manufactured by Invitrogen), pREP4 (manufactured 
by Invitrogen), pAGE103 (J. Biochem., 101: 1307 (1987)), pAGE210, and the like. 

[0215] Any promoter can be used so long as it can function in animal cells. Examples include a promoter of IE 
(immediate early) gene of cytomegalovirus (CMV), an early promoter of SV40, a promoter of retrovirus, a metal- 
lothionein promoter, a heat shock promoter, SRa promoter, and the like. Also, the enhancer of the IE gene of human 
CMV can be used together with Ihe promoter. 

[0216] Examples of the host cell include human Namalwa cell, monkey COS cell, Chinese hamster CHO cell, 
HST5637 (Japanese Published Unexamined Patent Application No. 299/88), and the like. 

[0217] The method for introduction of the recombinant vector into animal cells is not particularly limited, so long as 
it is the general method for introducing DNA into animal cells, such as an electroporation method (Cytotechnology, 3: 
133 (1990)), a calcium phosphate method (Japanese Published Unexamined Patent Application No. 227075/90), a 
lipofection method (Proc. Natl. Acad. ScL USA, 84, 7413 (1987)), the method described in Virology, 52: 456 (1973) 
and the like. " 

[0218] When insect cells are used as the host cells, the polypeptide can be expressed, for example, by the method 
described in Bacurovirus Expression Vectors, A Laboratory Manual, W.H. Freeman and Company, New York (1992) 
Biotechnology 6: 47 (1988), or the like. 

[0219] Specifically, a recombinant gene transfer vector and bacurovirus are simultaneously inserted into insect cells 
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to obtain a recombinant virus in an insect cell culture supernatant, and then the insect cells are infected with the resulting 
recombinant virus to express the polypeptide. 

[0220] Examples of the gene introducing vector used in the method include pBlueBac4.5 ( pVL1392, pVL1393 and 
pBlueBaclll (manufactured by Invitrogen), and the like. 

[0221] Examples of the bacurovirus include Autographa californica nuclear polyhidrosis virus with which insects of 
the family Barathra are infected, and the like. 

[0222] Examples of the insect cells include Spodoptera frugiperda oocytes Sf9 and Sf21 {Bacurovirus Expression 
Vectors, A Laboratory Manual, W.H. Freeman and Company, New York (1992)), Trichoplusia ni oocyte High 5 (manu- 
factured by Invitrogen) and the like. 

[0223] The method for simultaneously incorporating the above-described recombinant gene transfer vector and the 
above-described bacurovirus for the preparation of the recombinant virus include calcium phosphate method (Japanese 
Published Unexamined Patent Application No. 227075/90), lipofection method (Proc. Natl. Acad. ScL USA, 84: 7413 
(1987)) and the like. 

[0224] When plant ceils are used as the host cells, examples of expression vector include a Ti plasmid, a tobacco 
mosaic virus vector, and the like. 

[0225] Any promoter can be used so long as it can be expressed in plant cells. Examples include 35S promoter of 
cauliflower mosaic virus (CaMV), rice actin 1 promoter, and the like. 

[0226] Examples of the host cells include plant ceils and the like, such as tobacco, potato, tomato, carrot, soybean, 
rape, alfalfa, rice, wheal, barley, and the like. 

[0227] The method for introducing the recombinant vector is not particularly limited, so long as it is the general method 
for introducing DNA into plant cells, such as the Agrobacterium method (Japanese Published Unexamined Patent 
Application No. 140885/84, Japanese Published Unexamined Patent Application No. 70080/85, WO 94/00977), the 
electroporation method (Japanese Published Unexamined Patent Application No. 251887/85), the particle gun method 
(Japanese Patents 2606856 and 2517813), and the like. 

[0228] The transformant of the present invention includes a transformant containing the polypeptide of the present 
invention per se rather than as a recombinant vector, that is, a transformant containing the polypeptide of the present 
invention which is integrated into a chromosome of the host, in addition to the transformant containing the above 
recombinant vector. 

[0229] When expressed in yeasts, animal cells, insect cells or plant cells, a glycopolypeptide or glycosylated polypep- 
tide can be obtained. 

[0230] The polypeptide can be produced by culturing the thus obtained transformant of the present invention in a 
culture medium to produce and accumulate the polypeptide of the present invention or any polypeptide expressed 
under the control of an EMF of the present invention, and recovering the polypeptide from the culture. 
[0231] Culturing of the transformant of the present invention in a culture medium is carried out according to the 
conventional method as used in culturing of the host. 

[0232] When the transformant of the present invention is obtained using a prokaryote, such as Escherichia coli or 
the like, or a eukaryote, such as yeast or the like, as the host, the transformant is cultured. 

[0233] Any of a natural medium and a synthetic medium can be used, so long as it contains a carbon source, a 
nitrogen source, an inorganic salt and the like which can be assimilated by the transformant and can perform culturing 
of the transformant efficiently. 

[0234] Examples of the carbon source include those which can be assimilated by the transformant, such as carbo- 
hydrates (for example, glucose, fructose, sucrose, molasses containing them, starch, starch hydrolysate, and the like), 
organic acids (for example, acetic acid, propionic acid, and the like), and alcohols (for example, ethanol, propanol, and 
the like). 

[0235] Examples of the nitrogen source include ammonia, various ammonium salts of inorganic acids or organic 
acids (for example, ammonium chloride, ammonium sulfate, ammonium acetate, ammonium phosphate, and the like), 
other nitrogen-containing compounds, peptone, meat extract, yeasl extract, corn sleep liquor, casein hydrolysate, soy- 
bean meal and soybean meal hydrolysate, various fermented cells and hydrolysates thereof, and the like. 
[0236] Examples of inorganic salt include potassium dihydrogen phosphate, dipotassium hydrogen phosphate, mag- 
nesium phosphate, magnesium sulfate, sodium chloride, ferrous sulfate, manganese sulfate, copper sulfate, calcium 
carbonate, and the like. 

[0237] The culturing is carried out under aerobic conditions by shaking culture, submerged-aeration stirring culture 
or the like. The culturing temperature is preferably from 15 to 40°C, and the culturing time is generally from 16 hours 
to 7 days. The pH of the medium is preferably maintained at 3.0 to 9.0 during the culturing. The pH can be adjusted 
using an inorganic or organic acid, an alkali solution, urea, calcium carbonate, ammonia, or the like. 
[0238] Also, antibiotics, such as ampicillin, tetracycline, and the like, can be added to the medium during the culturing, 
if necessary. 

[0239] When a microorganism transformed with a recombinant vector containing an inducible promoter is cultured, 
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an inducer can be added to the medium, if necessary. 

[0240] For example, isopropyl-p-D-thiogalactopyranoside (IPTG) or the like can be added to the medium when a 
rn.croorgan.sm transformed with a recombinant vector containing lac promoter is cultured, or indoleacrylic acid (IAA) 
s iscuftured 08 " * microorganism transformed with an expression vector containing trp promoter 

Sf/ill J? 3 " 1 ? 68 °! thC mediUm USed in culturin 9 a transformant obtained using animal cells as the host cells include 

2ra ^o^% J ?^ na '° fthe AmeriCan Medical Association, 199. 519 (1967)). Eagle's MEM medium (Sci- 

s^lfrfn «• f . 6CC0S m ° dified MEM medium W' 0 "** 8 ' 396 < 1959 »- 199 Medium {Proceeding of the 

to *™^< heB "' c VicalMed l ane, 73:1 (1950)), the above-described mediate which fetal calf serum has been added, 

CO* tor I^VdaJs" 9 15 ° arried ° Ut 9enerally 3t 3 pH 0f 6 10 8 and a tem Perature of 30 to 40"C in the presence of 5% 

SSSturiHT' " ne ° eSSary ' antibiotics ' such as kanamycin, penicillin, and the like, can be added to the medium during 

™^*L Examples of the medium used in culturing a transformant obtained using insect cells as the host cells include 
Pvr^TlT! ( , manufactured b V Pharmingen), Sf-900 II SFM (manufactured by Life Technologies). ExCell 400 and 
™£« 40 f (manufactured by JRH Biosciences), Grace's Insect Medium (Nature, 195. 788 (1962)), and the like 
InoZ ! culturin 9 is carried out generally at a pH of 6 to 7 and a temperature of 25 to 30»C for 1 to 5 days 
[0246] Additionally, antibiotics, such as genlamicin and the like, can be added to the medium during the culturing. if 

f^ 47 ? » A t ™ sformant stained by using a plant cell as the host cell can be used as the cell or after differentiating 
UQi rf °»?^ an - 1**™^* of the medium us ed in the culturing of the transformant include Murashige and Skoog 
and tholfkT' 16 meCHUm ' mediS t0 Whi ° h 3 P ' ant hormone ' such as auxin - cytokinine, or the like has been added, 

CUltUrin9 is carried out generally « a P H of 5 to 9 and a temperature of 20 to 40»C for 3 to 60 days 
w£! * antib.ot.es, such as kanamycin, hygromycin and the like, can be added to the medium during the cul- 
tunng, if necessary. 3 

[0250] As described above, the polypeptide can be produced by culturing a transformant derived from a microor- 
ganism animal cell or plant cell containing a recombinant vector to which a DNA encoding the polypeptide of the 

!T a "r b6en inSerted accordina t0 th e general culturing method to produceand accumulate the polypep- 

tide, and recovering the polypeptide from the culture. 

S^IL™* wT*^ 0 - 9Bne expression ma V include secretion of the encoded protein production or fusion protein 

fv^rfff ° 6 accordance w ith the methods described in Molecular Cloning. 2nd ed.. in addition to direct 

expression. 

J* 1 ? m f, fll0d f ° r producin 9 the Polypeptide of the present invention includes a method of intracellular expres- 
Educed ° an be Sele ° ted by chan 9 in 9 the host ce " employed or the structure of the polypeptide 

ESn- V SJ en *?' polypeptide ot the present invention 's produced in a host cell or on a host cell membrane outer 
!?!.?7l « ? po ' ypept,de 030 be Positively secreted extracellularly according to, for example, the method of Paulson 
etaL (J. Biol. Chem., 264: 17619 (1989)), the method of Lowe etal. (Proc. Natl. Acad. Sci. USA 86. 8227 (1989)- 

N T^SSS ™ l^SS^ and/ ° rthe meth0dS deSCribed in Japanese Pub,ished Unexamined Patent Application 
No. 336963/93, WO 94/23021 , and the like. 

inn f« S ™ e S' i ! a " y ' the po| y peptide of tne present invention can be positively secreted extracellularly by expressing 
o^nl^T I? 8 S ' 9na PeptWe haS bee " added t0 the fore 9 ro und of a polypeptide containing an active site of the 
polypeptide of the present invention according to the recombinant DNA technique 

I Tfl ^ U | rlhermore ' ^ount produced can be increased using a gene amplification system, such as by use or 

P«t!L, T , red " Ctase 9 ene or the like according to the method described in Japanese Published Unexamined ' 
Patent Application No. 227075/90. 

!E?L ^ 0re ° Ver ' the po| yP e Pti d e of the present invention can be produced by a transgenic animal individual (trans- 
genic nonhuman animal) or plant individual (transgenic plant). 

SfS Wh0 " ?! transforrnant is the ar, imal individual or plant individual, tho polypeptide of the present invention 
^rJ,l«H r £ V " 9 ° r cultivatin 9 il 60 as to Produce and accumulate the polypeptide, and recovering the 
polypeptide from the animal individual or plant individual. 

S3, EXa !H P l!! ° f the meth0d f ° r producin 9 the polypeptide of the present invention using the animal individual 
LcrnrHin T 1 J producin 9 ,he Polypeptide of the present invention in an animal developed by inserting a gene 
according to methods known to those of ordinary skill in the art (American Journal of Clinical Nutrition, 63. 639S (1996) 
Amencan Journal of Clinical Nutrition, 63 627S (1 996), Biotechnology, 9 830 (1 991 )) 
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[0259] In the animal individual, the polypeptide can be produced by breeding a transgenic nonhuman animal to which 
the DNA encoding the polypeptide of the present invention has been inserted to produce and accumulate the polypep- 
tide in the animal, and recovering the polypeptide from the animal. Examples of the production and accumulation place 
in the animal include milk {Japanese Published Unexamined Patent Application No. 309192/88). egg and the like of 
5 the animal. Any promoter can be used, so long as it can be expressed in the animal. Suitable examples include an a- 
casein promoter a (P-casein promoter, a p-lactoglobulin promoter, a whey acidic protein promoter, and the like, which 
are specific for mammary glandular cells. 

[0260] Examples of the method for producing the polypeptide of the present invention using the plant individual 
include a method for producing the polypeptide of the present invention by cultivating a transgenic plant to which the 
io DNA encoding the protein of the present invention by a known method (Tissue Culture, 20 (1 994), Tissue Culture, 21 
(1 994), Trends in Biotechnology, 15: AS (1 997)) to produce and accumulate the polypeptide in the plant, and recovering 
the polypeptide from the plant. 

[0261] The polypeptide according to the present invention can also be obtained by translation in vitro. 

[0262] The polypeptide of the present invention can be produced by a translation system in vitro. There are, for 

*s example, two in vitro translation methods which may be used, namety, a method using RNA as a template and another 
method using DNA as a template. The template RNA includes the whole RNA, mRNA. an in vitro transcription product, 
and the like. The template DNA includes a plasmid containing a transcriptional promoter and a target gene integrated 
therein and downstream of the initiation site, a PCR/RT-PCR product and the like. To select the most suitable system 
for the in vitro translation, the origin of the gene encoding the protein to be synthesized (prokaryolic cell/eucaryolic 

20 cell), the type of the template (DNA/RNA), the purpose of using the synthesized protein and the like should be consid- 
ered. In vitro translation kits having various characteristics are commercially available from many companies (Boe- 
hringer Mannheim, Promega, Stratagene, or the like), and every kit can be used in producing the polypeptide according 
to the present invention. 

[0263] Transcription/translation of a DNA nucleotide sequence cloned into a plasmid containing a T7 promoter can 
25 be carried out using an in vitro transcription/translation system E. co//T7 S30 Extract System for Circular DNA (man- 
ufactured by Promega, catalogue No. L1130). Also, transcription/translation using, as a template, a linear prokaryotic 
DNA of a supercoil non-sensitive promoter, such as /acUVS, tac, XPL(con), XPL, or the like, can be carried out using 
an in vitro transcription/translation system E. co7/S30 Extract System for Linear Templates (manufactured by Promega, 
catalogue No. L1 030). Examples of the linear prokaryotic DNA used as a template include a DNA fragment, a PCR- 
50 amplified DNA product, a duplicated oligonucleotide ligation, an in vitro transcriptional RNA, a prokaryotic RNA, and 
the like. 

[0264] In addition to the production of the polypeptide according to the present invention, synthesis of a radioactive 
labeled protein, confirmation of the expression capability of a cloned gene, analysis of the function of transcriptional 
reaction or translation reaction, and the like can be carried out using this system. 

35 [0265] The polypeptide produced by the transformant of the present invention can be isolated and purified using the 
general method for isolating and purifying an enzyme. For example, when the polypeptide of the present invention is 
expressed as a soluble product in the host cells, the cells are collected by centrifugation after cultivation, suspended 
in an aqueous buffer, and disrupted using an ultrasonicator, a French press, a Manton Gaulin homogenlzer, a Dynomill, 
or the like to obtain a cell-free extract. From the supernatant obtained by centrifuging the cell-free extract, a purified 

40 product can be obtained by the general method used for isolating and purifying an enzyme, for example, solvent ex- 
traction, salting out using ammonium sulfate or the like, desalting, precipitation using an organic solvent, anion ex- 
change chromatography using a resin, such as diethylaminoethyl (DEAE)-Sepharose, DIAION HPA-75 (manufactured 
by Mitsubishi Chemical) or the like, cation exchange chromatography using a resin, such as S-Sepharose FF (manu- 
factured by Pharmacia) or the like, hydrophobic chromatography using a resin, such as butyl sepharose, phenyl sepha- 

45 rose or the like, gel filtration using a molecular sieve, affinity chromatography, chromatofocusing, or electrophoresis, 
such as isoelectronic focusing or the like, alone or in combination thereof. 

[0266] When Ihe polypeptide is expressed as an insoluble producl in the host cells, the cells are collected in Lhe 
same manner, disrupted and centrifuged to recover the insoluble product of the polypeptide as the precipitate fraction. 
Next, the insoluble product of the polypeptide is solubiiized with a protein denaturing agent. The solubilized solution 
so is diluted or dialyzed to lower the concentration of the protein denaturing agent in the solution. Thus, the normal con- 
figuration of the polypeptide is reconstituted. After the procedure, a purified product of the polypeptide can be obtained 
by a purification/isolation method similar to the above. 

[0267] When the polypeptide of the present invention or its derivative (for example, a polypeptide formed by adding 
a sugar chain thereto) is secreted out of cells, the polypeptide or its derivative can be collected in the culture supernatant. 
55 Namely, the culture supernatant is obtained by treating the culture medium in a treatment similar to the above (for 
example, centrifugation). Then, a purified product can be obtained from the culture medium using a purification/isolation 
method similar to the above. 

[0266] The polypeptide obtained by the above method is within the scope of the polypeptide of the present invention, 
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mTESfZSZ SSTS^Sr 1 ^ "■ P °* nUcle ° tide com P risi "9 *. nucleotide sequence seated from 
NOS.3502 to 6931 Po'VPept.de compns.ng an amino acid sequence represented by any one of SEO ID 

^.'SIIZ^^ 80 amin °, a f SeqUe " Ce Wh * h at least — amino acids is deieted, 

as that o t£££55ft lSS^rlS , T5r 6 PO,yPeptide and havin -" ^bstantialiy the same activity 

Ge/jfi 34 ^^MQon »/ , i"' ™ u; >- ™«>-. /wo^o^ (lytJ^;, Proc. NatL Acad. Sa. USA 79 6409 (-\982\ 

be caused in the Lme am no^c d in * ^quence. The deletion, replacement, insertion or addition may 

can be na^or non natu " ^S^TT^' ^ ^ ^ "*' rep ' aCed - inSerted or added 



Group A: 



Group B: 
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[0273] asparatic acid, oiutamic acid, isoasparatic acid, isoglutamic acid, 2-aminoadipfc acid. 2-aminosuberic acid; 
55 Group C: 

[0274] asparagine, glutamine; 
Group D: 

10275] lysine, arginine, ornithine, 2,4-diaminobutanoic acid, 2,3-diaminopropionic acid; 
Group E: 

[0276J proline, 3-hydroxyproline, 4-hydroxyproline; 
Group F: 

[0277] serine, threonine, homoserine- 

50 ' 

Group G: 

[0278] phenylalanine, tyrosine. 

80«/„ or mo^and partS o eie^w *" T »** ° han ^ of 60% " ™re Pliably 

lated. for example ^d^XS?? or more, wth the polypeptide which has not been mutated, when calcu- 
orthelike ^ amp ' e,USing default < '"^'^'"SJP^metersbyahomology searching software, such as BLAST FASTA 
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Lm^ m ! P° lypept,de of the P resent invention can be produced by a chemical synthesis method, such as 
Fmoc (fluorenylmethyloxycarbonyl) method. tBoc (t-butyloxycarbonyl) method, or the like. It can also be synthesized 
S,rL„? P S d * * yn * e f zer manufactured by Advanced ChemTech, Perkin-Elmer, Pharmacia. Protein Technology 
Instrument. Synthecell-Vega. PerSeptive. Shimadzu Corporation, or the like 

oHheVeIent t inventbn ant0f Pr6Sent inV6nti ° n Can be used for objects ot herthan the production of the polypeptide 

Sr a ? P H eCifi ^" y ' ? ' eaSt 006 component selected from a " amino acid, a nucleic acid, a vitamin, a saccharide, an 
Snmhi 8 ! "S °. 9U8S there ° f ° an be P roduced b V c """ring the transformant containing the polynucleotide or 
fr^mTm nn ^ ■ PreSent inV6nti ° n ,n 8 medium t0 P roduce and accumulate at least one component selected 
from the medium " SaCCharides " organic acids - and analogues thereof, and recovering the same 

X2L bi ° synth{ * is P at "ways. decomposition pathways and regulatory mechanisms of physiologically active 
Zfni T S " 88 am !- n0 ^ " UCleiC aCidS ' Vitamins ' saccharides, organic acids and analogues thereof differfrom 
orgamsm to organism. The productivity of such a physiologically active substance can be improved using these differ- 
of T,inf ^ !fh V Y ,ntr ° ducing a ^terogeneous gene relating to the biosynthesis thereof. For example, the content 
£222 / J °" e ,he eSSSntial amin ° acids ' in a P' ant seed was im P">ved by introducing a synthase gene 
svnthasl oTnfrr^ (W ° c 9 r 91 90> - AIS0 ' ar9inine iS e *cessive.y P™duced m a culture by Introducing an argfnine 

-r Escherichia coll (Japanese Examined Patent Publication 23750/93) 

be cuL Jh h ^ SUCh d P^^'^^a'^ aclive substance, the Iransformanl according to the present invention can 
.nvention 1 d^rH 6 mrthod " em P ,oved in culturing the transformant for producing the polypeptide of the present 
rTomb.w 1 f Ve ' AIS ° ! the P h y 5io,0 9 ica "y active substance can be recovered from the culture medium 
MasT pI T • f r eX ! m J P ' e ' thG i0n exchan 9 e resin method, the precipitation method and other known methods. 
tte^o^S^'E ih ° dS * n ™ n 10 one of ordinar y sk«' the art include electroporation. calcium transfection. 
£JZ Z TT , \ US,n9 8 Phag °' and th ° ,ikC ' Whcn tho h °st is a bacterium; and microinjection. 

Hte 1! S ! transf « !t,on - tne P° sitive| y Charged lipid-mediated method and the method using a virus, and the 
JE^rT T ♦ V eukaryote (Modular Cloning, 2nd ed.; Specter et at, Ce/fe/a fe^ratary /namia/. Cold Spring 
mah^r l^tltTf ' 19 , 98)) - EXamP ' eS ° f the hOSt indUde P rokaf y>tes. lower eukaryotes (for example, yeasts) 
frlmlnt ! ( !u t Xamp,e ' mammals >> and ce,,s Elated therefrom. As the state of a recombinant polynucleotide 

or^n^L, t XamP ' ! P aSmid) haVinS an inde Pendent replication unit outside the chromosome. These trans- 
J? " ? m producin9 the Polypeptides of the present invention encoded by the ORF of the genome of 
caX >Z HZ™ 9 ' Utam,cu "!: the Polynucleotides of the present invention and fragments thereof. Alternatively, they 
can be used in producing arbitrary polypeptides under the regulation by an EMF of the present invention. 

1 1 . Preparation of antibody recognizing the polypeptide of the present invention 

ES fl A «h«w tib ° d ?L W ru h reco9nizes the Polypeptide of the present invention, such as a polyclonal antibody, a mon- 

tavXZTZZ??? ' T ^ Pr ° dUCed USi " 9, 88 a " antigen " 3 purified product 01 the Polypeptide of the present 
ort^Z J t l 9mem po,ype P t,de of the Polypeptide or a peptide having a partial amino acid sequence of the 
polypeptide of the present invention. 

(1) Production of polyclonal antibody 

SSL** P0 '. yc,0na, antibod y ran be produced using, as an antigen, a purified product of the polypeptide of the 
o^lTJ^SS: «f P ,t ' ra9m ? nt 01 the Polypeptide, or a peptide having a partial amino acid sequence 

m^, XP P pr6Sent lnv en«on, and immunizing an animal with the same. 

mS f Xamp ' eS °/ l „ he animal lo be immunised include rabbils, goals, rats, mice, hamsters, chickens and Ihe like, 
luitraj A dosage of the antigen is preferably 50 to 1 00 ug per animal 

as 2 kfi ft .!? en tbe Pep,ide is used as the a ntigen, it is preferably a peptide covalently bonded to a carrier protein, such 

bl r„ltM! mf>e l emOCyanin ' b ° Vine ^y^ 1015 """- orth e like. The peptide used as the antigen can be synthesized 
oy a peptide synthesizer. 

JSTL I* 1 *; administration of tho antigen is. for example, carried out 3 to 10 times at the intervals of 1 or 2 weeks 
venouSf* ad ;™ ,stration - ° n the 3rd to 7th day after each administration, a blood sample is collected from the 
^2. r , eyeground, and it is confirmed that the serum reacts with the antigen by the enzyme immunoassay 

,9akU Sh0i " (1976 > ; Cold Spring 

♦? emm fe obtained from the immunised non-human mammal with a sufficient antibody titer against the antigen 
used for the immunization, and the serum is isolated and purified to obtain a polyclonal antibody. 
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(2) Production of monoclonal antibody 
(a) Preparation of antibody-producing cell 

tSSmUmS ZlZ^eTf^ZT antib0d , y titSr 3931,151 3 Partia ' ,r ** me " P***— «* the polypep- 
[02951 On the wlSrS^ ^ * US6d 38 8 SUpply source of an antibody-producing cell 
spleen fe Seised ^ ^ ^ ant ' 9en SUbStance is administered the rat showing the antfbod" titer, the 

[0297] The spleen in ^meS^ 

minutes to eliminate !^SSS!VSSll T T * Tris - amm ° niur " deride buffer (pH 7.65) for 1 to 2 
as antibody-producingTel™ t,meS W,th medlum - and *e resulting spleen cells are used 

(b) Preparation of myeloma cells 

S ^r^^^ r rat is used - Examp,es °< -« - — 

Immunol., 81: 1 (1978) • filL T F^tX Vl here ' nafterre,eTOdtoas " p 3-Un(C«yrr. r 0 p/ C s/^/l«cro- 
P3-X63-A g e 6 53 (653, M^S^ ( f;- 2) "* 269 < 1978 » : 
like, which are 8-azaguanine-resistant moutl ml. S , P3 - X , e3 - A 9 8 < X63 > 0611 (Wafore, 256. 495 (1975)), and the 

guanine medium (medium in ^teh toTZiurrttbS "T"^ ^ ^ Ce " " neS ™ subcu ""^ "" 8-aza- 
toethanol, 10 ^| oenliwn Si W l^S ^ ^^" 9 15 mm0l/, g,Utamine ' 5X1 °- 5 mol/l 2-mercap- 
inafter referred to as^oZl medium-f «T ( > (manufactured bv CSL) to RPMI-1640 medium <h.£ 

~or4daysbef^^^ 

(c) Production of hybridoma 

Sm JXJEffl^^ ? A"2J£,T r ce,, r btained in (b) are washed wi * 

g, distilled water: 1 liter dH 7 2?and Tv^h tl 9 \ SOd,um d '"ydrogen phosphate: 0.21 g, sodium chloride: 7.65 
1 . foltowed by cen triugation In 2M nZJl f ° f antibo ^^9 cells : myeloma cells = 5 : 1 to 1 o- 
[03001 The cells inSfr.^ ' ? ° r 5 mmUtes ' and the su Pematant is discarded. 

9 of Polyett^^^ th ri 0rO ^ ' 0OSened ' 0 2 10 1 m ' ° f a mixed * ^ 

antibody-producingceHs J ^^der sWrlTal'^rand m"' °I fT??*"* 1 " <DMSO) P6r 108 
added thereto several times at 1 to 2 mhuto huISL ' * 2 m ' ° f MEM medium is further 

fraction were gentlyioosenedTZhen S " pernatant ,s Warded. The cells in the resulting precipitated 

mo* hypoxanthine" 1 SxT* ^SSS andTx 10 ^ mo^ m ' " hATm ^ m <* e -edium to which 1 «n 

up into and discharging from a measuCplpette ammoptenn have been added) by repeated drawing 

X? «X2C* n " P0Ufed a 96 CU ' lUre P ' ale al 1 00 ^ and - 37-C for 7 to 14 days in 

noassay described in AnLdies AL^ZLnu^r ^T T™ " ****** aCCOrdin9 10 the 

S3 ; sp ~-r mp,oof ^ 

* obtained in (d) described teZTaZ?^* H? 7 1* hyb " d ° ma CU ' tUring su P er ™ or a purified antibody 
noglobulin antibody labeled w^an en™ 1 h' , ^ t0 With an anti - rat or antknouse immu- 

secondantbodufor^^r " 1. J . ":r a Cnem,Ca ' ,uminous substance, a radioactive substance or the like as a 

tide of the present invention fe^ !* 

P .nventoon ,s selected as a hybndoma capable of producing a monoclonal antibody of tne preleSt 
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invention. 

(d) Preparation of monoclonal antibody 

^weLkird 6 ^ 0 " 0010 "? antibod y-P roduc '"9 hybridoma cells obtained in (c) are injected intraperitoneal!/ into 8- to 
ipSSSlr^^.T 8 TV™ PriStane administration of 0.5 ml o^Zo Zetone*. 

£^1Z?XT£Zr Y """" ° f fe6din9) 81 5X106 ,0 20X106 Ce " S/animal - hybhdoma iutes 
£n° for 5 minutef " *" ° r nUde mk »' and * *™ -"tents at 3000 

ss » A ^!srjsK and iso,ated from the resu,,ing supernatant accordino to - 

otai JlSSS^VV^ ^ bS determined usi "9 a monodona. antibody typing kit or a rat mon- 

T^iT^Tz^T P ° lyPePt,de am ° Unt ^ be determ ' ned * »• L0Wf V method oTSy Lcuiation baled 

EJ a n!'i°H y 0bt t ed in the 3b0Ve iS Withi " sc °P e of the antibod y °< «ie present invention 

aSSa * using an antibody, such as'a radioactive materia, iabeied 
„h ^ x competitive binding assay, an immunotissue chemical staininq method (ABC method r^A moth 

Vol 2 M9831 A Vol ^ /iooc D ° ( Tochn "l uos "> Immunocytochomistry, Academic Press, Vol. 1 (1982) 

e g hoSSS oe o?dif/»Sr T^' a " ^ ,abel <•*• biotin " avidin ' or the « te >- enzyme .abe. 
"kela^^^^ ° r the ,ike >' a ""orescence iabe. (e.g.. FITC. rhodamine. or the 

s~ r ^ * 308 < 1979 >= 

of a Sp^ ° r absence in an ^nism other than coryneform bacteria 

She a^^^ bV an,ib ° dy ^ bS PUrifiSd by " a «^ chromatography 

12. Production and use of polypeptide array 
(1) Production of polypeptide array 

po^ryiamSrt^e 'iZZlZTIT?* ^ M " P 0 »<**>™** » the like; an acrylic resin, such as 

; Ucr: d ™ 

support -SiSS^^S?^ T 0 ^" 9 10 *° Pr0SCnt inV ° nti0n ° an bC adhcrcd to th0 surfa - - the soiid 
(19991 desenbed .n Biotechniques, 27: 1258-61 (1999); Molecular Medicine Today 5 326-7 

S T^ aZZ^'T^" 0 ^' editi0n ' B,aCkWe " SCiSntifiC Pub,fcati0n ^ Cha P te " £X£ 

present in^e^Tfh d «Sr d ° an ^ performed b * adherin 9 the P"ftW«ld- or antibody of the 

present invention to the solid support at a high density, though a high fixation density is not always necessar/ 
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(2) Use of polypeptide array 
[0322] 
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15 



9 a polypeptides of the present invention to the following steps (i) to (iv)- 

s ess ssr** ^ ha * s *° - - p— . ^ ^ by me meffiM 

compound, o, a second labal mu » b I ° ** " ""** ™" °' ' *"°°<< PMyPWoa o, 
unbound materia has bean ramo««j anT^ " 10 8 °™P0"« of lb. cornpl„ afte 

fiv) analvyino th« ^* / . ' 



—wwn ICIIH, 

(iv) analyzing the detection data. 
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sequence selected from SEQ ,D NO&3502 t oX^ ^noacid 
least one amino acids is deleted, replaced S^er'^SS^T^^^^ a " ami "° acid set ' uence «" which at 
substantia.^ the same activity as th'at of the ^ 

homology of 60% or more with the amino acid soquTncos of t^Zl T*'™ 9 a " amin ° acld sea - uence h *ving a 

(iv) analyzing the detection data. 

compnsmg an amino acid sequence i»^ ilTo?D^i^? 80 ant,t>0dy Which rec °9™«s a polypeptide 
acid sequence in which at least one am£S£E 5fJ? "? :35 ° 2 ,0 7001 • a Polypeptide comprising iTaSno 
of the polypeptide and having substan^ ^^TZtlt ZT^?^ " ^ in the «*» ^d'sequ^e 
ammo acd sequence having a homology X/o^eSZZ the H P ° lype P tide - a Polypeptide comprising an 
ami™ m T SSme aCMV,t y as that <* 'he polypeS ^ ^ ^T 6 " 068 * "» Peptide and having 

ammo acid sequence of a part of a polypeptide * fra9ment P 0 ^™*. or a peptide comprising an 

^-ime^^^^^^ 

optimized by analyzing the fluctuation " cor y nefo ™ b *oteria. The cultunng conditions can be 

ISLT" 8 P ~ — <™ * — of coryneform bacteria is used, a mutated polypeptide can be 
13. Identification of useful mutation in mutant by proteome analysis 
Se,e^^^ 

BBT^X; mass 8 ™«° SKSC^SSCT e — ** «— i-« - the 
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T03321 In sparrhino tl ^ S ° electnc _ P ' Van0us se P ara, '°n patterns can be achieved (JIS K 3600 2474) 

Sg .tSrai' b3CteriUm te mUtam ^ " '° SSib,e to «-* • Peptide 

S p^duSX\T a rIf 0 l a T T Strain ° f COrynef0mi baCteria and a P roducti0 " •* showing an im- 

LeZl ZwZit 1! ?22£?S T kes ,l possib,e ,0 efficiently identif y a mutation p*** «hich b " 

[0335 SpeSra SdET^ ? * T Pr ° dUCt " 8 pr ° tein ° f which «P«-on amount is fluctuated, 
.o J Pn 2S^wr'?£r ^t^ 11 ^,??^ 3nd 3 'y sine -P^9 ^in thereof are each subjected 
found and a TaL base fe searched hT f *Z t™**"*** strain, compared with the wild type strain, is 
in the lysine p^Z^^SS^i^T Sh ° Win9 *" inCr63Se h yie ' d ,n accorda "<* ^han Urease 
a fysinlproducingtS the S^^cS^^; al^T" 16 ^ ° n 8 «™ and 

3785 is increased in the lysine-produdng m!5ant 9 S6qUenCe represented * SEQ '° 

SLl^^ ' eVel fe ident,,ied * P"> te °™ -n-y* "Sing the nuc.eotide 

present inventio^d IT^^Z^ T T ,n,0m,ati0n ' of the 9 enome * the coryneform bacteria of the 

protein and th^^ nUC J e0lide se < uence °< «» 9«» encoding this 
sepu havingahighexpro™ 

storing tho nucleotide sequence VnZ^lnT P 66 efhcient, y iden ««ed using the recording medium 

SSL ( n M u°cS iirTT 1 can be easny spet * fed * — » a «— - ~ 

medium ^^^1^^ !"? re ' at ' n9 * ^ thUS identifi6d pr0tein USin 9 a recordin 9 

corynefomi bacteria of ^rTJnT f- ,nfofmat,on and "» am '"° «W sequence information, of the genome of 

» signed on'hetiis o K ^^E^r* 1 * "^T "* "* ^ de " 

industrially useful mutant laZZZT ° sequence - ^ a res "t that the useful mutation point is specified an 
OaaSTrhS^ mUtati0n ° r ° ther useful station derived therefrom can be easNy bred 

's not kited thereto '" ^ b6,0W based on ^wever. the present Mention 

35 Example 1 

Determination of the full nucleotide sequence of genome of Corynebacterium gtutamicum 

~ whotUm^ 

the terminal sequences were de ermined *n™5L cfi )>- , u ° d ' 8 9en ° me " brary was P re Pared and 
cover the full genome. sJSSSSS^^ °" 3 * 

^ (1) Preparation of genome DNA of Corynebacterium glutamicum ATCC 13032 

by centrifugation. Aftef wiSa S STE SrTET B 81 3 °° C ° Vemi9hl 8 " d lhe Ce " S were co,,ecled 

8 0) the cells wLT «LT h „ 9 buffer (10.3% sucrose, 25 mmoW Tris hydrochloride, 25 mmol/l EDTA dH 

a^Si^^JS'S'JJ sds wI^HH^r C ° ntaining 10 m9/ml lysozyme ' fo,,owed * ^ S2i - 

at 6S*C for 1 0 minuS «„h h f ^ 8dded theret ° t0 ,yse the cells ' and the resulta "t mixture w as maintained 

follotcdbygc^ 
aqueous layLwrsenarf^ 

in the same ^ST^^JC? tSU^ ^ phenol/ch,0 ^ omi and with ch.orofom, (Lee) 

1/10 times volume anH^. ? * ' S ° d ' Um acetate so,ution <P H 5 2 ) and isopropanol were added at 

^ b : x 9ent,y Stirrin9 t0 Prcciptete *• «^-~ DNA. ?he 
0-02 mg/m. of RN J ^ZSS ned Jt 37^C ^ 1 mmol/,EDTA -PH B.0) containing 

chloroform were carried ortTeSlS in th» " Th8 extractions with P"en°'- phenol/chloroform and 

ere carried out successively in the same manner as the above. The genome DNA was subjected to iso- 
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*o mi oi i fc buffer to g,ve a genome DNA solution (concentration: 0.1 mg/ml). 
(2) Construction of a shotgun library 

l ™LZtZ:ZZToZ ^t:^ DNA of atcc 

at an output of 20 continuously for 5 second! to Tl t * * S ° niCat0r (Yamat0 p °wersonrc Model 150) 

ended using a DNA b.unting * 1 ° 9en ° me fra9mentS 

electrophoresis. Genome fragments of 1 to ?to wJwnS ,f * . donated by 6% polyacrylamide gel 

ammonium acetate. lOmmollagLtm^ 5?SL" nd ° 3 m ' MG e,Uti0n buffer < 0 - 5 ™«" 
at37-Cov e rnighttoeluteDNA.TheDNA^SwasZtJd^ 

to obtain a genome library insert. The tota! nse^ a " d then I™**— with ethano. 

macia Biotech) were ligated at 1 6»C to 40 Zrs 9 P Smal/BAP (manufactured by Amersham Phar- 

electroporation under conditions accord na toth^ZLT ™ (manufactured by Life Technologies) by the 

medium (LB medium (to * b££SS£ JS^^^'STS ^ ^ ^ ip ~ - °" P |ate 
containing 0.1 mg/ml ampicillin, 0.1 mg/m X-aal and mmSJ ^ fJT Ch, ° nde ' pH 7 0) containin 9 1 -6% of agar) 
at 37'C overnight. ^ 9dl dnd 1 mmol/l ■ so P ro PyH3-D-thiogalaclopyranoside (IPTG) and cultured 

EiLEEXrS ^S^^o*^T- T medium was stationari * — d * * 

containing ^ glycerol was added 
(3) Construction of cosmid library 

£2l r^rsTuzTand ST?"**? ATCC 13032 Was P-"* with 

sucrose density gradient obtahS ^^J^t^^ (26 '°°° ^ 18 h0Urc " 20 ° C > unde " 0 *> <"% 
» mmo M EDTA, 10%or40%«aS S ISiE^ZJ^ ° NaC '' 2 ° ^ ^ 5 
tubes at 1 ml in each tube After confirming thTnMA * 9 ' th * S ° lut,0n thus se P*"«ed was fractionated into 

a fraction containing a larg^ ^^^t^iS^'S?; " "* by 96 ' ^Pnoresis 

t0346] The DNA fragment was B<^^eS3?2^7 J? prec, P teted with «*«*>l. 
with the manufacture's instructions Si £^S^^S^' (manufactured by Stratagene) in accordance 
(manufacturedbystratagene) using ^ int0 ******* «*XL-1-BlueMR strain 

with the manufacture's instruSVrTS^ 

icillin and cuftured therein at ^o^r^ZslTc^T^ °" LB » P ' a,e mediUm C ° ntainin 9 01 m ^ ^P' 
37-C overnight in a 96-we.l titer plSS^^,?^!?* ' h 9 C ° ,0nieS Stati ° nari,y Cultured * 

(4) Determination of nucleotide sequence 
(4- 1 ) Preparation of template 

tPo,e T S~o^ 

PCR method using the library prepared in The above^ Sh ° t9Un meth0d Was P re P ared * the 

Sed^^^^ 

acjp-ato •£SJ!S^^ a ^ P'f (manufactured by Tokken) into a 96-wel. re- 

Takara Shuzo)) at 0.08 ml pe> each weH Ten pcZ^ T"" ^f™ Ex Taq <™>"'acturcd by 

(DNA Research, 5: 1 -9 (1 * 9B)) using Gen^o PCR ^^l^T aCC t ° rdanc * with the P"*»l by Makino « a? 
inserted fragment. 9 ^ et)eAmp PCR Svste m 9700 (manufactured by PE Biosystems) to amplify the 

Sy XZZtt^ ">* • " P"^ing a PCR production (manufac- 

I0351J Some nucleotide sequen«^52lZ2?^ ? 1,16 temP ' a,e ,n the ^^"^"3 action. 

quences were determined using a double-stranded DNA plasmid as a template. 
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[0352] The double-stranded DNA plasmid as the template was obtained by the following method. 
[0353] The clone derived from the whole genome shotgun library was inoculated into a 24- or 96-well plate containing 
a 2x YTmedium (1 6 g/l bactotrypton ; 1 0 g/l yeast extract, 5 g/l sodium chloride, pH 7.0) containing 0.05 mg/ml ampicillin 
at 1 .5 ml per each well and then cultured under shaking at 37°C overnight. 

[0354] The double-stranded DNA plasmid was prepared from the culturing solution using an automatic plasmid pre- 
paring machine, KURABO PI-50 (manufactured by Kurabo Industries) or a multiscreen (manufactured by Millipore) in 
accordance with the protocol provided by the manufacturer. 

[0355] To purify the double-stranded DNA plasmid using the multiscreen, Biomek 2000 (manufactured by Beckman 
Coulter) or the like was employed. 

[0356] The thus obtained double-stranded DNA plasmid was dissolved in water to give a concentration of about 0 1 
mg/ml and used as the template in sequencing. 

(4-2) Sequencing reaction 



[0357] To 6 \x\ of a solution of ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit (manufactured 
by PE Biosystems), an M13 regular direction primer (M13-21) or an M13 reverse direction primer (M13REV) (DNA 
Research, 5: 1-9 (1998) and the template prepared in the above (4-1) (the PCR product or the plasmid) were added 
to give 10 ^il of a sequencing reaction solution. The primers and the templates were used in an amount of 1 .6 pmol 
and an amount of 50 Lo 200 ng, respectively. 

20 [0358] Dye terminator sequencing reaction of 45 cycles was carried out with GeneAmp PCR System 9700 (manu- 
factured by PE Biosystems) using the reaction solution. The cycle parameter was determined in accordance with the 
manufacturer's instruction accompanying ABI PRISM BigDye Terminator Cycle Sequencing Ready Reaction Kit. The 
sample was punfied using Multiscreen HV plate (manufactured by Millipore) according to the manufacture's instruc- 
tions. The thus purified reaction product was precipitated with cthanol, followed by drying, and then stored in the dark 

25 a t -3o°C. 

[0359] The dry reaction product was analyzed by ABI PRISM 377 DNA Sequencer and ABI PRISM 3700 DNA An- 
alyzer (both manufactured by PE Biosystems) each in accordance with the manufacture's instructions. 
[0360] The data of about 50,000 sequences in total (i.e., about 42,000 sequences obtained using 377 DNA Sequenc- 
er and about 8,000 reactions obtained by 3700 DNA Analyser) were transferred to a server (Alpha Server 41 00: man- 
ufactured by COMPAQ) and stored. The data of these about 50,000 sequences corresponded to 6 times as much as 
the genome size. 

(5) Assembly 



[0361] All operations were carried out on the basis of UNIX platform. The analytical data were output in Macintosh 
platform using X Window System. The base call was carried out using phred (The University of Washington). The 
vector sequence data was deleted using SPS Crossmatch (manufactured by Southwest Parallel Software). The as- 
sembly was carried out using SPS phrap (manufactured by Southwest Parallel Software; a high-speed version of phrap 
(The University of Washington)). The contig obtained by the assembly was analyzed using a graphical editor, consed 
(The University of Washington). A series of the operations from the base call to the assembly were carried out simul- 
taneously using a script phredPhrap attached to consed. 

(6) Determination of nucleotide sequence in gap part 

[0362] Each cosmid in the cosmid library constructed in the above (3) was prepared by a method similar to the 
preparation of the double-stranded DNA plasmid described in the above (4-1 ). The nucleotide sequence at the end of 
the inserted fragment or the cosmid was determined by using ABI PRISM BigDye Terminator Cycle Sequencing Ready 
Reaction Kit (manufactured by PE Biosystems) according to the manufactured instructions. 

[0363] About 800 cosmid clones were sequenced at both ends to search a nucleotide sequence in the contig derived 
from the shotgun sequencing obtained in the above (5) coincident with the sequence. Thus, the linkage between re- 
spective cosmid clones and respective contigs were determined and mutual alignment was carried out. Furthermore, 
^ rCSults WCr ° comparcd with tnc Physical map of Coryncbactcrium glutamicum ATCC 13032 (MoL Gen. Genet, 
252: 255-265 (1996) to carrying out mapping between the cosmids and the contigs. 

[0364] The sequence in the region which was not covered with the contigs was determined by the following method. 
[0365] Clones containing sequences positioned at the ends of contigs were selected. Among these clones, about 
1 ,000 clones wherein only one end of the inserted fragment had been determined were selected and the sequence at 
the opposite end of the inserted fragment was determined. A shotgun library clone or a cosmid clone containing the 
sequences at the respective ends of the inserted fragment in two contigs was identified, the full nucleotide sequence 
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end sequences at the two conls ^Tpr^lZ ^n^ P ™ erS ^P'^ntary to the 

sequencing was performed by Se p*™ vatoLZl!* " ^ 93P P3,t W8S amplified «* PCR Then, 

shotgun method in which the LaLnce* ^ 9 ■ 9 the amp ' ified DNA fra 9 ment as a ter "P'«te or by the 

were sy nthesteed usjng AUTOFINISH function and 
walking method to improve the sequence prS ?%£!£Tt the sequence was determined by the primer 
Corynebacterium glutamicum ATC^^3032 ^ratn is sbown^n SEQ Id"^.-^ S6qUenCe °' the " 

(7) Identification of ORF and presumption of its function 

Ed. ^ilSiP^ZrSZ^ 22f° ? N h° :1 ^ id6ntified aCCOrdi " a 40 the "«*•» 

GeneMark - 

Ld 1 " °" d3ta thUS ° btained ' ORFs in the nuc| s°«*> sequence represented by sl Q ,d NO:1 were iden- 

Prot, PIR or Genpept dLba.rc6nXS bv nrotl !f " 9 protein - encodin 9 domains derived from Swiss- 
Search (manufactured by C^S^Z^tZ^? dmtml U ° m GenBank database = Frame 
against an amino acid databas^co^ 

^abaso instituted by protein cncodina^ D 08 de " Ved fr ° m Swiss - p '<*. «R or Genpept 

- o^rs^::^ 

ashaving^h/hes^^^^ 

the amino acid sequent ^T^l^Z seq " ences of tne ORFs as the results of the homology search in 
names of the gene of SsZe^ Tf ^"^ SOftWare Frame Search Manufactured by Smpuqen" 

* compared J plTCSSr ^ 9 "~' ^ the matChed ^ 'Entities and analogies 
firmed via the .kni^nlSt^^ 1,16 coires P°" d ^ Prions were 

1 • Also, the positions of n^ct^StiS^^^^S^ £T ? S6qUenCe ° f SEQ ,D NO: 

'v e r nces ' and *• ,,ke > * tS^^rs: ssssir amp,e - nbosoma ' rna 9enes - ^ rna 

4o 73] ^showstheposflonsof^^ 
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bicyclomycin resistance protein 


sodium dependent phosphate pump 


phenazine biosynthesis protein 




ABC transporter 


ABC transporter ATP-binding protein 
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sodium/gtutamate sympori earner 
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monooxygenase or oxidoreductase 
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Function 


hypothetical protein 
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ABC transporter 


ABC transporter | 


metabolite transport protein homolog 
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succinyl-diaminopimelate 
desuccinylase 










dehydrin-hke protein 


maltose/maltodextrin transport ATP- 
binding protein 
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Function 


transcriptional regulator 


i 






hypothetical protein 


glucan 1,4-alpha-glucosidase 
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Corynebacterium diphtheriae 
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Streptomyces coelicolor A3(2) 
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methyltransferase 


nodulin 21 -related protein 
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transposon tn501 resolvase 
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transposase protein fragment 
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l 
I 

i 

! 
i 

i 


glyceraldehyde-3-phosphate 
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copper/potassium-transporting 
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Streptomyces coelicolor A3(2) 
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Corynebacterium glutamicum 
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CO 
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hypothetical protein 


hypothetical protein 


partitioning or sporulation protein 


glucose inhibited division protein B I 


hypothetical membrane protein 


ribonuclease P protein component 


SOS ribosomal protein L34 






L-aspartate-alpha-decarboxylase 
precursor 


2-isopropylmalate synthase 


hypothetical protein 


aspartate-semialdehyde 
dehydrogenase 


3-dehydroquinase 
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Table 1 (continued) 


Homologous gene 




Chlamydomonas reinhardtii thi2 


Bacillus subtilis cwlB 






Mycobacterium tuberculosis 
H37Rv Rv3916c 


Pseudomonas putida ygi2 


Mycobacterium tuberculosis 
H37Rv parB 


Escherichia coli K12 gidB 


Mycobacterium tuberculosis 
H37Rv Rv3921c 


Bacillus subtilis rnpA 


Mycobacterium avium rpmH 






Corynebacterium glutamicum 
panD 


Corynebacterium glutamicum 
ATCC 13032 leuA 


Corynebacterium glutamicum 
(Brevibacterium flavum) ATCC 
13032 orfX 


Corynebacterium glutamicum 
asd 


Corynebacterium glutamicum 
AS019 aroD 


35 
40 




db Match 




LU 
QT 
_1 
X 

o 

I 

CN 

X 


CWLB_BACSU 






D70851 


YGI2_PSEPU 


r> 

CL 
LU 
CO 
0_ 

> 


GIDB ECOLI 


A70852 


= 

CO 

u 

<• 

z 

or 


MAU19185J 






i 

co 

CO 
r— 
t— 
LL 
< 


LEU1_CORGL 


— 1 
O 
or 
O 
0 

1 

LU 
-J 


DHAS_CORGL 


AF124518J 








CL 
t/i 


CL 






Q. 


CL 

tn 


CL 

VI 


CL 

(A 


o. 


CL 

tfl 


o. 

CJ) 






cL 

O) 


CL 

(/> 


CL 

co 


CL 


CL 

cn 








1185 


CN 
CO 


1242 


111 \ 


1041 


CO 
CO 


1152 


r-- 
co 

CO 


cn 

CO 
CO 


to 
Oi 


O) 
O) 
CO 


CO 
CO 
CO 


cn 

CN 


CN 
CN 
CN 


406 


1848 


LO 
CO 
CN 


1032 




45 




Terminal 
(nt) 


3300119 


3301729 


3302996 


3301989 


3304475 


3302999 


3303636 


3304835 


3305864 


3306682 


3307971 


3308412 


3309321 


3308822 


147573 


266154 


T 

T-* 

CO 

CO 

to 

CN 


271691 


446521 


50 




Initial 
(nt) 


3301303 


3301358 


3301755 


3302765 


3303435 


3303616 


3304787 


3305671 


3306532 


3307632 


3308369 


3308747 


3309028 


3309043 ! 


147980 


T" 
O 

O 

GO 

to 

| CN 


269068 


— 1 

270660 


446075 






CO 2 « 


6918 


6919 


i 

6920 


6921 


6922 


6923 


6924 


6925 


6926 


6927 


6928 


6929 


6930 


|693ll 


6932 


6933 


CO 

cn 

CO 


6935 


6936 






SEQ 
NO 
(ONA) 


3418 


3419 


3420 


3421 


3422 


CO 
CN 
XT 
CO 


CN 
CO 


3425 


3426 


3427 


CO 

CN 

CO 


3429 


3430 


3431 


3432I 


_j 

3433 


3434 


3435 


3436 
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Function 


elongation factor Tu 


preprotein translocase secY subuit 


isocitrate dehydrogenase 
(oxalosucctnatedecarboxylase) 


acyl-CoA carboxylase or biotin- 
binding protein 


citrate synthase 


putative binding protein or peptidyl- 
prolyl cis-trans isomerase 


glycine betatne transporter 


hypothetical membrane protein 


L-lysine permease 


aromatic amino acid permease 


hypothetical protein 


succinyl diamtnopimelate 
desuccmylase 


proline transport system 


arginyl-tRNA synthetase 
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Table 1 (conlinued) 


Homologous gene 


Corynebacterium glutamicum 
ATCC 13059 tuf 


Corynebacterium glutamicum 
(Brevibacterium flavum) MJ233 
secY 


Corynebacterium glutamicum 
ATCC 13032 icd 


Corynebacterium glutamicum 
ATCC 13032 accBC 


Corynebacterium glutamicum 
ATCC 13032 gltA 


Corynebacterium glutamicum 
ATCC 13032 fkbA 


Corynebacterium glutamicum 
ATCC 13032 betP 


Corynebacterium glutamicum 
ATCC 13032 orf2 


Corynebacterium glutamicum 
ATCC 13032 lysl 


Corynebacterium glutamicum 
ATCC 13032 aroP 


Corynebacterium glutamicum 
ATCC 13032orf3 


Corynebacterium glutamicum 
ATCC 13032dapE 


Corynebacterium glutamicum 
ATCC 13032 putP 


Corynebacterium glutamicum 
AS019 ATCC 13059 argS 
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NAOH dehydrogenase 


phosphoribosyl-ATP- 
pyrophosphohydrolase 


ornithine-cyclodecarboxylase 


ammonium uptake protein, high 
affinity 


protein-export membrane protein 
secG 


phosphoenolpyruvate carboxylase 


chorismate synthase (5- 

enolpyruvylsnikimate-3-phosphate 

phospholyase) 


restriction endonuctease 


sigma factor or RNA polymerase 
transcription factor 


glutamate-binding protein 


recA protein 


dihydrodipicolinate synthase 


dihydrodipicolinate reductase 


L-malate dehydrogenase (acceptor) 
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Homologous gene 


Corynebacterium glutamicum 
ATCC 13032 ndh 


Corynebacterium glutamicum 
AS019hisE 


Corynebacterium glutamicum 
ATCC 13032 ocd 


Corynebacterium glutamicum 
ATCC 13032 amt 


Corynebacterium glutamicum 
ATCC 13032 secG 


Corynebacterium glutamicum 
ATCC 13032 ppc 


Corynebacterium glutamicum 
AS019 aroC 


Corynebacterium glutamicum 
ATCC 13032 cglllR 


Corynebacterium glutamicum 
ATCC 13869 sigB 


Corynebacterium glutamicum 
ATCC 13032 gluB 


Corynebacterium glutamicum ! 
AS019recA 


Corynebacterium glutamicum 
(Brevibacterium lactofermentum) 
ATCC 13869 dapA 


Corynebacterium glutamicum 
(Brevibacterium lactofermentum) 
ATCC 13869 dapB 


Corynebacterium glutamicum 
R127 mqo 
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Function 


uridilylyltransferase, uridilylyl- 
removing enzyme 


nitrogen regulatory protein P-ll 


ammonium transporter 


glutamate dehydrogenase (NADP+) 


pyruvate kinase 


glucokinase 


gtutamine synthetase 


threonine synthase 


ectoine/proline/gtycine betaine 
carrier 


malate synthase 


isocitrate lyase 


glutamate 5-kinase 


| cystathionine gamma-synlhase 


| ribonucleotide reductase 


glutaredoxin 
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Corynebacterium glutamicum 
ATCC 13032 glnD 


Corynebacterium glutamicum 
ATCC 13032 glnB 


Corynebacterium glutamicum 
ATCC 13032 amtP 


Corynebacterium glutamicum 
ATCC 17965 gdhA 


Corynebacterium glutamicum 
AS019 pyk 


Corynebacterium glutamicum 
ATCC 13032 glk 


Corynebacterium glutamicum 
ATCC 13032 glnA 


Corynebacterium glutamicum 
thrC 


Corynebacterium glutamicum 
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Example 2 

Determination of effective mutation site 

(1) Identification of mutation site based on the comparison of the gene nucleotide sequence of lysine-producing B-6 
strain with that of wild type strain ATCC 13032 

[0374] Corynebactehum glutamicum B-6, which is resistant to S-(2-aminoethyl)cysteine (AEC), rifampicin, strepto- 
mycin and 6-azauracil, is a lysine-producing mutant having been mutated and bred by subjecting the wild type ATCC 
13032 strain to multiple rounds of random mutagenesis with a mutagen, N-methyt-N' -nitro-N-nitrosoguanidine (NTG) 
and screening (AppL Microbiol. Biotechnol.. 32. 269-273 (1989)). First, the nucleotide sequences of genes derived 
from the B-6 strain and considered to relate to the lysine production were determined by a method similar to the above. 
The genes relating to the lysine production include lysE and lysG which are lysine-excreting genes; ddh, dapA, horn 
and lysC (encoding diaminopimelate dehydrogenase, dihydropicolinate synthase, homoserine dehydrogenase and 
aspartokinase, respectively) which are lysine-biosynthetic genes; and pyc and zwf (encoding pyruvate carboxylase 
and glucose-6-phosphate dehydrogenase, respectively) which are glucose-metabolizing genes. The nucleotide se- 
quences of the genes derived from the production strain were compared with the corresponding nucleotide sequences 
of the ATCC 13032 strain genome represented by SEQ ID NOS:1 to 3501 and analyzed. As a result, mutation points 
were observed in many genes. For example, no mutation site was observed in lysE, lysG. ddh, dapA, and the like, 
whereas amino acid replacement mutations were found in horn, lysC, pyc, zwf, and the like. Among these mutation 
points, those which are considered to contribute to the production were extracted on the basis of known biochemical 
or genetic information. Among the mutation points thus extracted, a mutation, Val59Ala, in horn and a mutation, 
Pro458Ser, in pyc were evaluated whether or not the mutations were effective according to the following method. 

(2) Evaluation of mutation, Vai59Ala. in horn and mutation. Pro458Ser, in pyc 

[0375] It is known that a mutation in horn inducing requirement or partial requirement for homoserine imparts lysine 
productivity to a wild type strain (Amino Acid Fermentation, ed. by Hiroshi Aida et ai, Japan Scientific Societies Press). 
However, the relationship between the mutation, Val59Ala, in horn and lysine production is not known. It can be ex- 
amined whether or not the mutation, Val59Ala, in horn is an effective mutation by introducing the mutation to the wild 
type strain and examining the lysine productivity of the resulting strain. On the other hand, it can be examined whether 
or not the mutation, Pro458Ser, in pyc is effective by introducing this mutation into a lysine-producing strain which has 
a deregulated lysine-bioxynthetic pathway and is free from the pyc mutation, and comparing the lysine productivity of 
the resulting strain with the parent strain. As such a lysine-producing bacterium. No. 58 strain (FERM BP-7134) was 
selected (hereinafter referred to the "lysine-producing No. 58 strain" or the "No. 58 strain"). Based on the above, it was 
determined that the mutation, Val59Ala, in horn and the mutation, Pro458Ser, in pyc were introduced into the wild type 
strain of Corynebactehum glutamicum ATCC 13032 (hereinafter referred to as the "wild type ATCC 13032 strain" or 
the "ATCC 13032 strain") and the lysine-producing No. 58 strain, respectively, using the gene replacement method. A 
plasmid vector pCES30 for the gene replacement for the introduction was constructed by the following method. 
[0376] A plasmid vector pCE53 having a kanamycin -resistant gene and being capable of autonomously replicating 
in Coryneform bacteria (Mol. Gen. Genet, 19& 175-178 (1984)) and a plasmid pMOB3 (ATCC 77282) containing a 
levansucrase gene (sacB) of Bacillus subtilis (Molecular Microbiology, 6: 1195-1204 (1992)) were each digested with 
Psrt. Then, after agarose gel electrophoresis, a pCE53 fragment and a 2.6 kb DNA fragment containing sacB were 
each extracted and purified using GENECLEAN Kit (manufactured by BIO 101 ). The pCE53 fragment and the 2.6 kb 
DNA fragment were ligated using Ligation Kit ver. 2 (manufactured by Takara Shuzo), introduced into the ATCC 1 3032 
strain by the electroporation method (FEMS Microbiology Letters, 65: 299 (1 989)), and cultured on BYG agar medium 
(medium prepared by adding 1 0 g of glucose, 20 g of peptone (manufactured by Kyokuto Pharmaceutical), 5 g of yeast 
extract (manufactured by Difco), and 16 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjusting its 
pH to 7.2) containing 25 u,g/ml kanamycin at 30°C for 2 days to obtain a transformant acquiring kanamycin-resistance. 
As a result of digestion analysis with restriction enzymes, it was confirmed that a plasmid extracted from the resulting 
transformant by the alkali SDS method had a structure in which the 2.6 kb DNA fragment had been inserted into the 
Psti site of pCE53. This plasmid was named pCES30. 

[0377] Next, two genes having a mutation point, horn and pyc, were amplified by PCR, and inserted into pCES30 
according to the TA cloning method (Bio Experiment Illustrated vol. 3, published by Shujunsha). Specifically, pCES30 
was digested with SamHI (manufactured by Takara Shuzo), subjected to an agarose gel electrophoresis, and extracted 
and purified using GENECLEAN Kit (manufactured by BI0 101). The both ends of the resulting pCES30 fragment were 
blunted with DNA Blunting Kit (manufactured by Takara Shuzo) according to the attached protocol. The blunt-ended 
pCES30 fragment was concentrated by extraction with phenol/chloroform and precipitation with ethanol, and allowed 



EP 1 108 790 A2 

to react in the presence of Taq polymerase (manufactured by Roche Diagnostics) and dTTP at 70°C for 2 hours so 
that a nucleotide, thymine (T), was added to the 3'-end to prepare a T vector of pCES30. 
[0378] Separately, chromosomal DNA was prepared from the lysine-producing B-6 strain according to the method 
of Saito et at. (Biochem. Biophys. Acta, 72. 619 (1963)). Using the chromosomal DNA as a template, PCR was carried 
5 out with Pfu turbo DNA polymerase (manufactured by Stratagene). In the mutated horn gene, the DNAs having the 
nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were used as the primer set. In the mutated pyc 
gene, the DNAs having the nucleotide sequences represented by SEQ ID NOS:7004 and 7005 were used as the primer 
set. The resulting PCR product was subjected to agarose gel electrophoresis, and extracted and purified using GENE- 
GLEAN Kit (manufactured by BIO 1 01 ). Then, the PCR product was allowed to react in the presence of Taq polymerase 
w (manufactured by Roche Diagnostics) and dATP at 72°C for 1 0 minutes so that a nucleotide, adenine (A), was added 
to the 3'-end, 

[0379] The above pCES30 T vector fragment and the mutated horn gene (1 .7 kb) or mutated pyc gene (3.6 kb) to 
which the nucleotide A had been added of the PCR product were concentrated by extraction with phenol/chloroform 
and precipitation with ethanol, and then ligated using Ligation Kit ver. 2. The ligation products were introduced into the 
15 ATCC 13032 strain according to the electroporation method, and cultured on BYG agar medium containing 25 ug/ml 
kanamycin at 30°C for 2 days to obtain kanamycin-resistant transformants. Each of the resulting transformants was 
cultured overnight in BYG liquid medium containing 25 u.g/ml kanamycin. and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes, it was 
confirmed that the plasmid had a structure in which the 1 .7 kb or 3.6 kb DNA fragment had been inserted into pCES30. 
20 The plasmids thus constructed were named respectively pChom59 and pCpyc458. 

[0380] The introduction of the mutations to the wild type ATCC 13032 strain and the lysine-producing No. 58 strain 
according to the gene replacement method was carried out according to the following method. Specifically, pChom59 
and pCpyc458 were introduced to the ATCC 13032 strain and the No. 58 strain, respectively, and strains in which the 
piasmid is integrated into the chromosomal DNA by homologous recombination were selected using the method of 
25 ikeda et al. (Microbiology 144: 1863 (1998)). Then, the stains in which the second homologous recombination was 
carried out were selected by a selection method, making use of the fact that the Bacillus subtilis levansucrase encoded 
by pCES30 produced a suicidal substance (J. of Bacterid., 174: 5462 (1992)), Among the selected strains, strains in 
which the wild type horn and pyc genes possessed by the ATCC 1 3032 strain and the No. 58 strain were replaced with 
the mutated horn and pyc genes, respectively, were isolated. The method is specifically explained below. 
30 [0381] One strain was selected from the transformants containing the plasmid, pChom59 or pCpyc458, and the 
selected strain was cultured in BYG medium containing 20 ug/mi kanamycin, and pCG11 (Japanese Published Exam- 
ined Patent Application No. 91827/94) was introduced thereinto by the electroporation method. pCG11 is a plasmid 
vector having a spectinomycin-resistant gene and a replication origin which is the same as pCE53. After introduction 
of the pCGII, the strain was cultured on BYG agar medium containing 20 ug/ml kanamycin and 1 00 ug/ml spectinomycin 
35 at 30°C for 2 days to obtain both the kanamycin- and spectinomycin-resistant transformant. The chromosome of one 
strain of these transformants was examined by the Southern blotting hybridization according to the method reported 
by Ikeda et al. (Microbiology, 144: 1863 (1998)). As a result, it was confirmed that pChom59 or pCpyc458 had been 
integrated into the chromosome by the homologous recombination of the Cambell type. In such a strain, the wild type 
and mutated horn or pyc genes are present closely on the chromosome, and the second homologous recombination 
40 is liable to arise therebetween. 

[0382] Each of these transformants (having been recombined once) was spread on Sue agar medium (medium 
prepared by adding 100 g of sucrose, 7 g of meat extract, 10 g of peptone, 3 g of sodium chloride, 5 g of yeast extract 
(manufactured by Difco), and 18 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjusting its pH 7.2) 
and cultured at 30°C for a day. Then the colonies thus growing were selected in each case. Since a strain in which the 
45 sacB gene is present converts sucrose into a suicide substrate, it cannot grow in this medium (J. Bacterioi, 1 74: 5462 
(1 992)). On the other hand, a strain in which the sacB gene was deleted due to the second homologous recombination 
between the wild type and the mutated horn or pyc genes positioned closely to each other forms no suicide substrate 
and, therefore, can grow in this medium. In the homologous recombination, either the wild type gene or the mutated 
gene is deleted together with the sacB gene. When the wild type is deleted together with the sacB gene, the gene 
so replacement into the mutated type arises. 

[0383] Chromosomal DNA of each the thus obtained second recombinants was prepared by the above method of 
Saito et al. PCR was carried out using Pfu turbo DNA polymerase (manufactured by Stratagene) and the attached 
buffer. In the horn gene, DNAs having the nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were 
used as the primer set. Also, in the pyc gene was used, DNAs having the nucleotide sequences represented by SEQ 
55 id NOS:7004 and 7005 were used as the primer set. The nucleotide sequences of the PCR products were determined 
by the conventional method so that it was judged whether the horn or pyc gene of the second recombinant was a wild 
type or a mutant. As a result, the second recombinant which were called HD-1 and No. 58pyc were target strains having 
the mutated horn gene and pyc gene, respectively. 
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(3) Lysine production test of HD-1 and No. 58pyc strains 

[0384] The HD-1 strain (strain obtained by incorporating the mutation, Val59Ala, in the horn gene into the ATCC 
13032 strain) and the No. 58pyc strain (strain obtained by incorporating the mutation, Pro458Ser, in the pyc gene into 

5 the lysine-producing No. 58 strain) were subjected to a culture test in a 5 I jar fermenter by using the ATCC 13032 
strain and the lysine-producing No. 58 strain respectively as a control. Thus lysine production was examined. 
[0385] After culturing on BYG agar medium at 30°C for 24 hours, each strain was inoculated into 250 ml of a seed 
medium (medium prepared by adding 50 g of sucrose, 40 g of corn steep liquor, 8.3 g of ammonium sulfate, 1 g of 
urea, 2 g of potassium dihydrogenphosphate, 0.83 g of magnesium sulfate heptahydrate, 10 mg of iron sulfate hep- 

10 tahydrate, 1 mg of copper sulfate pentahydrate, 1 0 mg of zinc sulfate heptahydrate, 1 0 mg of (3-alanine, 5 mg of nicotinic 
acid, 1 .5 mg of thiamin hydrochloride, and 0.5 mg of biotin to 1 liter of water, and adjusting its pH to 7.2, then to which 
30 g of calcium carbonate had been added) contained in a 2 1 buffle-attached Erlenmeyer flask and cultured therein 
at 30°C for 1 2 to 1 6 hours. A total amount of the seed culturing medium was inoculated into 1 ,400 ml of a main culture 
medium (medium prepared by adding 60 g of glucose, 20 g of com steep liquor, 25 g of ammonium chloride, 2.5 g of 

15 potassium dihydrogenphosphate, 0.75 g of magnesium sulfate heptahydrate, 50 mg of iron sulfate heptahydrate. 1 3 
mg of manganese sulfate pentahydrate, 50 mg of calcium chloride, 6.3 mg of copper sulfate pentahydrate, 1 .3 mg of 
zinc sulfate heptahydrate, 5 mg of nickel chloride hexahydrate, 1 .3 mg of cobalt chloride hexahydrate, 1 .3 mg of am- 
monium molybdenate tetrahydrate, 14 mg of nicotinic acid, 23 mg of ^-alanine, 7 mg of thiamin hydrochloride, and 
0.42 mg of biotin to 1 liter of water) contained in a 5 1 jar fermenter and cultured therein at 32°C, 1 wm and 800 rpm 

20 while controlling the pH to 7.0 with aqueous ammonia. When glucose in the medium had been consumed, a glucose 
feeding solution (medium prepared by adding 400 g glucose and 45 g of ammonium chloride to 1 liter of water) was 
continuously added. The addition of feeding solution was carried out at a controlled speed so as to maintain the dis- 
solved oxygen concentration within a range of 0.5 to 3 ppm. After culturing for 29 hours, the culture was terminated. 
The cells were separated from the culture medium by centrifugation and then L-lysine hydrochloride in the supernatant 

25 was quantified by high performance liquid chromatography (HPLC). The results are shown in Table 2 below. 



Table 2 


Strain 


L-Lysine hydrochloride yield (g/l) 


ATCC 13032 


0 


HD-1 


8 


No. 58 


45 


No. 58pyc 


51 



35 [0386] As is apparent from the results shown in Table 2, the lysine productivity was improved by introducing the 
mutation, Val59Ala, in the horn gene or the mutation, Pro458Ser, in the pyc gene. Accordingly, it was found that the 
mutations are both effective mutations relating to the production of lysine. Strain, AHP-3, in which the mutation, 
Vai59Ala, in the horn gene and the mutation, Pro458Ser, in the pyc gene have been introduced into the wild type ATCC 
13032 strain together with the mutation, Thr331lle in the tysC gene has been deposited on December 5, 2000, in 

40 National Institute of Bioscience and Human Technology, Agency of Industrial Science and Technology (Higashi 1-1-3, 
Tsukuba-shi, Ibaraki, Japan) as FERM BP-7382. 

Example 3 

45 Reconstruction of lysine-producing strain based on genome information 

[0387] The lysine-producing mutant B-6 strain (Appl. Microbiol. Biotechnol., 32. 269-273 (1989)), which has been 
constructed by multiple round random mutagenesis with NTG and screening from the wild type ATCC 1 3032 strain, 
produces a remarkably large amount of lysine hydrochloride when cultured in a jar at 32°C using glucose as a carbon 
50 source. However, since the fermentation period is long, the production rate is less than 2. 1 g/l/h. Breeding to reconstitute 
only effective mutations relating to the production of lysine among the estimated at least 300 mutations introduced into 
the B-6 strain in the wild type ATCC 13032 strain was performed. 

(1 ) Identification of mutation point and effective mutation by comparing the gene nucleotide sequence of the B-6 strain 
55 with that of the ATCC 13032 strain 

[0388] As described above, the nucleotide sequences of genes derived from the B-6 strain were compared with the 
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corresponding nucleotide sequences of the ATCC 13032 strain genome represented by SEQ ID NOS:1 to 3501 and 
analyzed to identify many mutation points accumulated in the chromosome of the B-6 strain. Among these, a mutation, 
Val591Ala, in horn, a mutation, Thr311lle, in lysC, a mutation, Pro458Ser, in pyc and a mutation, Aia213Thr, in zwf 
were specified as effective mutations relating to the production of lysine. Breeding to reconstitute the 4 mutations in 
5 the wild type strain and for constructing of an industrially important lysine-producing strain was carried out according 
to the method shown below. 



(2) Construction of plasmid for gene replacement having mutated gene 

w [0389] The plasmid for gene replacement, pChom59, having the mutated horn gene and the plasmid for gene re- 
placement, pCpyc458, having the mutated pyc gene were prepared in the above Example 2(2). Plasmids for gene 
replacement having the mutated lysC and zwf were produced as described below. 

[0390] The lysC and zwf having mutation points were amplified by PCR, and inserted into a plasmid for gene re- 
placement, pCES30, according to the TA cloning method described in Example 2(2) (Bio Experiment Illustrated, Vol. 3). 

15 [0391] Separately, chromosomal DNA was prepared from the lysine-producing B-6 strain according to the above 
method of Saito etai. Using the chromosomal DNA as a template, PCR was carried out with Pfu turbo DNA polymerase 
(manufactured by Stratagene). In the mutated lysC gene, the DNAs having the nucleotide sequences represented by 
SEQ ID NOS:7006 and 7007 were used as the primer set. In the mutated zwf gene, the DNAs having the nucleotide 
sequences represented by SEQ ID NOS:7008 and 7009 as the primer set. The resulting PCR product was subjected 

20 to agarose gel electrophoresis, and extracted and purified using GENEGLEAN Kit (manufactured by BIO 1 01 ). Then, 
the PCR product was allowed to react in the presence of Taq DNA polymerase (manufactured by Roche Diagnostics) 
and dATP at 72°C for 10 minutes so that a nucleotide, adenine (A), was added to the 3'-end. 

[0392] The above pCES30 T vector fragment and the mutated lysC gene (1 .5 kb) or mutated zwf gene (2.3 kb) to 
which the nucleotide A had been added of the PCR product were concentrated by extraction with phenol/chloroform 

25 and precipitation with ethanol, and then ligated using Ligation Kit ver. 2. The ligation products were introduced into the 
ATCC 13032 strain according to the electroporation method, and cultured on BYG agar medium containing 25 u.g/ml 
kanamycin at 30° C for 2 days to obtain kanamycin-resistant transformants. Each of the resulting transformants was 
cultured overnight in BYG liquid medium containing 25 jig/ml kanamycin, and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes, it was 

30 confirmed that the plasmid had a structure in which the 1 .5 kb or 2.3 kb DNA fragment had been inserted into pCES30. 
The plasmids thus constructed were named respectively pClysC311 and pCzwf213. 

(3) Introduction of mutation, Thr311lle, in lysC into one point mutant HD-1 

35 [0393] Since the one mutation point mutant HD-1 in which the mutation, Val59Ala, in horn was introduced into the 
wild type ATCC 13032 strain had been obtained in Example 2(2), the mutation, Thr311lle, in lysC was introduced into 
the HD-1 strain using pClysC311 produced in the above (2) according to the gene replacement method described in 
Example 2(2). PCR was carried out using chromosomal DNA of the resulting strain and, as the primer set, DNAs having 
the nucleotide sequences represented by SEQ ID NOS:7006 and 7007 in the same manner as in Example 2(2). As a 

to result of the fact that the nucleotide sequence of the PCR product was determined in the usual manner, it was confirmed 
that the strain which was named AHD-2 was a two point mutant having the mutated lysC gene in addition to the mutated 
horn gene. 

(4) Introduction of mutation, Pro458Ser, in pyc into two point mutant AHD-2 

45 

[0394] The mutation, Pro458Ser, in pyc was introduced into the AHD-2 strain using the pCpyc458 produced in Ex- 
ample 2(2) by the gene replacement method described in Example 2(2). PCR was carried out using chromosomal 
DNA of the resulting strain and, as the primer set, DNAs having the nucleotide sequences represented by SEQ ID 
NOS:7004 and 7005 in the same manner as in Example 2(2). As a result of the fact that the nucleotide sequence of 
so the PCR product was determined in the usual manner, it was confirmed that the strain which was named AHD-3 was 
a three point mutant having the mutated pyc gene in addition to the mutated horn gene and lysC gene. 



(5) Introduction of mutation, Ala213Thr, in zwf into three point mutant AHP-3 



55 [0395] The mutation, Ala213Thr, in zwf was introduced into the AHP-3 strain using the pCzwf458 produced in the 
above (2) by the gene replacement method described in Example 2(2). PCR was carried out using chromosomal DNA 
of the resulting strain and, as the primer set, DNAs having the nucleotide sequences represented by SEQ ID NOS: 
7008 and 7009 in the same manner as in Example 2(2). As a result of the fact that the nucleotide sequence of the PCR 
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product was determined in the usual manner, it was confirmed that the strain which was named APZ-4 was a four point 
mutant having the mutated zwf gene in addition to the mutated horn gene, ly$C gene and pyc gene. 



(6) Lysine production test on HD-1, AHD-2, AHP-3 and APZ-4 strains 

5 . 

[0396] The HD-1, AHD-2, AHP-3 and APZ-4 strains obtained above were subjected to a culture test in a 5 I jar 
fermenter in accordance with the method of Example 2(3). 
[0397] Table 3 shows the results. 

w Table 3 



Strain 


L-lysine hydrochloride (g/l) 


Productivity (g/l/h) 


HD-1 


8 


0.3 


AHD-2 


73 


2.5 


AHP-3 


80 


2.8 


APZ-4 


86 


3.0 



[0398] Since the lysine-producing mutant B-6 strain which has been bred based on the random mutation and selection 
shows a productivity of less than 2.1 g/l/h, the APZ-4 strain showing a high productivity of 3.0 g/!/h is useful in industry. 

20 

(7) Lysine fermentation by APZ-4 strain at high temperature 

[0399] The APZ-4 strain, which had been reconstructed by introducing 4 effective mutations into the wild type strain, 
was subjected to the culturing test in a 5 1 jar fermenter in the same manner as in Example 2(3), except that the cuituring 
25 temperature was changed to 40°C. 

[0400] The results are shown in Table 4. 



Table 4 



Temperature (°C) 


L-Lysine hydrochloride (g/l) 


Productivity (g/l/h) 


32 


86 


3.0 


40 


95 


3.3 



[0401] As is apparent from the results shown in Table 4, the lysine hydrochloride titer and productivity in culturing at 
35 a high temperature of 40°C comparable to those at 32°C were obtained. In the mutated and bred lysine-producing B- 
6 strain constructed by repeating random mutation and selection, the growth and the lysine productivity are lowered 
at temperatures exceeding 34° C so that lysine fermentation cannot be carried out, whereas lysine fermentation can 
be carried out using the APZ-4 strain at a high temperature of 40°C so that the load of cooling is greatly reduced and 
it is industrially useful. The lysine fermentation at high temperatures can be achieved by reflecting the high temperature 
40 adaptability inherently possessed by the wild type strain on, the APZ-4 strain. 

[0402] As demonstrated in the reconstruction of the lysine-producing strain, the present invention provides a novel 
breeding method effective for eliminating the problems in the conventional mutants and acquiring industrially advan- 
tageous strains. This methodology which reconstitutes the production strain by reconstituting the effective mutation is 
an approach which is efficiently carried out using the nucleotide sequence information of the genome disclosed in the 
45 present invention, and its effectiveness was found for the first time in the present invention. 

Example 4 

Production of DNA microarray and use thereof 

50 

[0403] A DNA microarray was produced based on the nucleotide sequence information of the ORF deduced from 
the full nucleotide sequences of Corynebacterium glutamicum ATCC 1 3032 using software, and genes of which ex- 
pression is fluctuated depending on the carbon source during culturing were searched. 

55 (1 ) Production of DNA microarray 



[0404] Chromosomal DNA was prepared from Corynebacterium glutamicum ATCC 1 3032 by the method of Saito et 



EP 1 108 790 A2 



ai ( Biochem. Biophys. Acta, 72. 619 (1963)). Based on 24 genes having the nucleotide sequences represented by 
SEQ ID NOS:207, 3433, 281, 3435, 3439, 765, 3445. 1226, 1229, 3448, 3451, 3453, 3455, 1743. 3470, 2132, 3476. 
3477, 3485, 3488, 3489, 3494, 3496, and 3497 from the ORFs shown in Table 1 deduced from the full genome nucle- 
otide sequence of Corynebacterium giutamicum ATCC 13032 using software and the nucleotide sequence of rabbit 
5 globin gene (GenBank Accession No. V00882) used as an internal standard, oligo DNA primers for PCR amplification 
represented by SEQ ID NOS:7010 to 7059 targeting the nucleotide sequences of the genes were synthesized in a 
usual manner. 

[0405] As the oligo DNA primers used for the PCR, 

[0406] DNAs having the nucleotide sequence represented by SEQ ID NOS:7010 and 7011 were used for the ampli- 
fy fication of the DNA having the nucleotide sequence represented by SEQ ID NO:207, 

[0407] DNAs having the nucleotide sequence represented by SEQ ID NOS:7012 and 7013 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3433, 

[0408] DNAs having the nucleotide sequence represented by SEQ ID NOS:7014 and 7015 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:281 , 
15 [0409] DNAs having the nucleotide sequence represented by SEQ ID NOS:7016 and 7017 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3435, 

[0410] DNAs having the nucleotide sequence represented by SEQ ID NOS:7018 and 7019 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3439, 

[0411] DNAs having the nucleotide sequence represented by SEQ ID NOS:7020 and 7021 were used for the am- 
20 plification of the DNA having the nucleotide sequence represented by SEQ ID NO:765, 

[0412] DNAs having the nucleotide sequence represented by SEQ ID NOS:7022 and 7023 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3445 t 

[0413] DNAs having the nucleotide sequence represented by SEQ ID NOS:7024 and 7025 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1226, 
25 [0414] DNAs having the nucleotide sequence represented by SEQ ID NOS:7026 and 7027 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO: 1229, 

[0415] DNAs having the nucleotide sequence represented by SEQ ID NOS:7028 and 7029 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3448, 

[0416] DNAs having the nucleotide sequence represented by SEQ ID NOS:7030 and 7031 were used for the am- 
30 plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3451 , 

[0417] DNAs having the nucleotide sequence represented by SEQ ID NOS:7032 and 7033 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3453, 

[0418] DNAs having the nucleotide sequence represented by SEQ ID NOS:7034 and 7035 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3455, 
35 [0419] DNAs having the nucleotide sequence represented by SEQ ID NOS:7036 and 7037 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO: 1743, 

[0420] DNAs having the nucleotide sequence represented by SEQ ID NOS:7038 and 7039 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3470, 

[0421] DNAs having the nucleotide sequence represented by SEQ ID NOS:7040 and 7041 were used for the am- 
40 plification of the DNA having the nucleotide sequence represented by SEQ ID NO:2132, 

[0422] DNAs having the nucleotide sequence represented by SEQ ID NOS:7042 and 7043 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3476, 

[0423] DNAs having the nucleotide sequence represented by SEQ ID NOS:7044 and 7045 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3477, 
45 [0424] DNAs having the nucleotide sequence represented by SEQ ID NOS:7046 and 7047 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3485, 

[0425] DNAs having the nucleotide sequence represented by SEQ ID NOS:7048 and 7049 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3488, 

[0426] DNAs having the nucleotide sequence represented by SEQ ID NOS:7050 and 7051 were used for the am- 
50 plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3489, 

[0427] DNAs having the nucleotide sequence represented by SEQ ID NOS:7052 and 7053 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3494, 

[0428] DNAs having the nucleotide sequence represented by SEQ ID NOS:7054 and 7055 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3496, 
55 [0429] DNAs having the nucleotide sequence represented by SEQ ID NOS:7056 and 7057 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3497, and 
[0430] DNAs having the nucleotide sequence represented by SEQ ID NOS:7058 and 7059 were used for the am- 
plification of the DNA having the nucleotide sequence of the rabbit globin gene, 
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as the respective primer set. 

[0431] The PCR was carried for 30 cycles with each cycle consisting of 15 seconds at 95°C and 3 minutes at 68°C 
using a thermal cycler (GeneAmp PCR system 9600, manufactured by Perkin Elmer), TaKaRa EX-Taq (manufactured 
by Takara Shuzo), 1 00 ng of the chromosomal DNA and the buffer attached to the TaKaRa Ex-Taq reagent. In the case 

5 of the rabbit globin gene, a single-stranded cDNA which had been synthesized from rabbit globin mRNA (manufactured 
by Life Technologies) according to the manufacture's instructions using a reverse transcriptase RAV-2 (manufactured 
by Takara Shuzo). The PCR product of each gene thus amplified was subjected to agarose gel electrophoresis and 
extracted and purified using QIAquick Gel Extraction Kit (manufactured by QIAGEN). The purified PCR product was 
concentrated by precipitating it with ethanol and adjusted to a concentration of 200 ng/u.1. Each PCR product was 

10 spotted on a slide glass plate (manufactured by Matsunami Glass) having MAS coating in 2 runs using GTMASS 
SYSTEM (manufactured by Nippon Laser & Electronics Lab.) according to the manufacture's instructions. 

(2) Synthesis of fluorescence labeled cDNA 

15 [0432] The ATCC 1 3032 strain was spread on BY agar medium (medium prepared by adding 20 g of peptone (man- 
ufactured by Kyokuto Pharmaceutical), 5 g of yeast extract (manufactured by Difco), and 16 g of Bactoagar (manufac- 
tured by Difco) to in 1 liter of water and adjusting its pH to 7.2) and cultured at 30°C for 2 days. Then, the cultured 
strain was further inoculated into 5 ml of BY liquid medium and cultured at 30°C overnight. Then, the cultured strain 
was further inoculated into 30 ml of a minimum medium (medium prepared by adding 5 g of ammonium sulfate, 5 g of 

20 urea, 0.5 g of monopotassium dihydrogenphosphate, 0.5 g of dipotassium monohydrogenphosphate, 20.9 g of mor- 
pholinopropanesulfonic acid, 0.25 g of magnesium sulfate heptahydrate, 10 mg of calcium chloride dihydrate, 10 mg 
of manganese sulfate monohydrate, 10 mg of ferrous sulfate heptahydrate, 1 mg of zinc sulfate heptahydrate, 0.2 mg 
copper sulfate, and 0.2 mg biotin to 1 liter of water, and adjusting its pH to 6.5) containing 110 mmol/l glucose or 200 
mmol/l ammonium acetate, and cultured in an Erlenmyer flask at 30° to give 1 .0 of absorbance at 660 nm. After the 

25 cells were prepared by centrifuging at 4°C and 5,000 rpm for 10 minutes, total RNA was prepared from the resulting 
cells according to the method of Bormann etai ( Molecular Microbiology, & 317-326 (1992)). To avoid contamination 
with DNA, the RNA was treated with Dnasel (manufactured by Takara Shuzo) at 37°C for 30 minutes and then further 
purified using Qiagen RNeasy MiniKit (manufactured by QIAGEN) according to the manufacture's instructions. To 30 
jxg of the resulting total RNA, 0.6 |il of rabbit globin mRNA (50 ng/uJ, manufactured by Life Technologies) and 1 pi of 

30 a random 6 mer primer (500 ng/u.1, manufactured by Takara Shuzo) were added for denaturing at 65°C for 1 0 minutes, 
followed by quenching on ice. To the resulting solution, 6 |il of a buffer attached to Superscript It (manufactured by 
Lifetechnologies), 3 \i\ of 0,1 mol/l DTT, 1 .5 fil of dNTPs (25 mmol/l dATP, 25 mmol/l dCTP, 25 mmoi/l dGTP, 10 mmol/ 
I dTTP), 1 .5 \i\ of Cy5-dUTP or Cy3-dUTP (manufactured by NEN) and 2 \i\ of Superscript II were added, and allowed 
to stand at 25°C for 10 minutes and then at 42°C for 110 minutes. The RNA extracted from the cells using glucose as 

35 the carbon source and the RNA extracted from the cells using ammonium acetate were labeled with Cy5-dUTP and 
Cy3-dUTP, respectively. After the fluorescence labeling reaction, the RNA was digested by adding 1.5 \i\ of 1 mol/l 
sodium hydroxide-20 mmol/l EDTA solution and 3.0 uJ of 10% SDS solution, and allowed to stand at 65°C for 10 
minutes. The two cDNA solutions after the labeling were mixed and purified using Qiagen PCR purification Kit (man- 
ufactured by QIAGEN) according to the manufacture's instructions to give a volume of 10 uJ. 

40 

(3) Hybridization 

[0433] UltraHyb (110 \x\) (manufactured by Ambion) and the fluorescence-labeled cDNA solution (10 uJ) were mixed 
and subjected to hybridization and the subsequent washing of slide glass using GeneTAC Hybridization Station (man- 
45 ufactured by Genomic Solutions) according to the manufacture's instructions. The hybridization was carried out at 
50°C, and the washing was carried out at 25°C. 

(4) Fluorescence analysis 

50 [0434] The fluorescence amount of each DNA array having the fluorescent cDN A hybridized therewith was measured 
using ScanArray 4000 (manufactured by GSI Lumonics). 

[0435] Table 5 shows the Cy3 and Cy5 signal intensities of the genes having been corrected on the basis of the data 
of the rabbit globin used as the internal standard and the Cy3/Cy5 ratios. 



Table 5 



SEQ ID NO 


Cy3 intensity 


Cy5 intensity 


Cy3/Cy5 


207 


5248 


3240 


1.62 
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Table 5 (continued) 
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uyj iiiidioiiy 
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3433 
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^oyo 
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3435 
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O A OO 
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D I OH 
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O400 
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1493 


0.87 




1 1 DO 


1131 
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n 74 


3451 
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jooy 


u. / ** 


3453 


349o 
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3455 


1 491 


1 1 44 


1 . JU 


1743 


1972 


1841 


1 .U/ 


3470 


4752 


o7b4 


1 Oft 


2132 


11 73 


lUoo 


i ,Uo 


3476 


lo47 


1 4^U 


1 QA 
I .JU 


3477 


i^o4 


1 1 o4 


1 . 1 u 


3485 


4ooy 


oUl 4 


O ^7 


3488 


34289 


1398 


24.52 


3489 


43645 


1497 


29,16 


3494 


3199 


2503 


1.28 


3496 


3428 


2364 


1.45 


3497 


3848 


3358 


1.15 



[0436] The ORF function data estimated by using software were searched for SEQ ID NOS:3488 and 3489 showing 
remarkably strong Cy3 signals. As a result, it was found that SEQ ID NOS:3488 and 3489 are a maleate synthase 
gene and an isocitrate lyase gene, respectively. It is known that these genes are transcriptionally induced by acetic 
acid in Corynebacterium gfutamicum (Archives of Microbiology, 168. 262-269 (1997)). 

[0437] As described above, a gene of which expression is fluctuates could be discovered by synthesizing appropriate 
oligo DNA primers based on the ORF nucleotide sequence information deduced from the full genomic nucleotide 
sequence information of Corynebacterium gfutamicum ATCC 1 3032 using software, amplifying the nucleotide sequenc- 
es of the gene using the genome DNA of Corynebacterium giutamicum as a template in the PCR reaction, and thus 
producing and using a DNA microarray. 

[0438] This Example shows that the expression amount can be analyzed using a DNA microarray in the 24 genes. 
On the other hand, the present DNA microarray techniques make it possible to prepare DNA microarrays having thereon 
several thousand gene probes at once. Accordingly, it is also possible to prepare DNA microarrays having thereon all 
of the ORF gene probes deduced from the full genomic nucleotide sequence of Corynebacterium giutamicum ATCC 
1 3032 determined by the present invention, and analyze the expression profile at the total gene level of Corynebac- 
terium giutamicum using these arrays. 

Example 5 

Homology search using Corynebacterium giutamicum genome sequence 
(1) Search of adenosine deaminase 

[0439] The amino acid sequence (ADD_ECOLI) of Escherichia coll adenosine deaminase was obtained from Swiss- 
prot Database as the amino acid sequence of the protein of which function had been confirmed as adenosine deaminase 
(EC3.5.4.4). By using the full length of this amino acid sequence as a query, a homology search was carried out on a 
nucleotide sequence database of the genome sequence of Corynebacterium giutamicum or a database of the amino 
acids in the ORF region deduced from the genome sequence using FASTA program (Proc. Natl. Acad. Sci. ISA, 85: 
2444-2448 (1 988)). A case where E-value was le* 10 or less was judged as being significantly homologous. As a result, 
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no sequence significantly homologous with the Escherichia coli adenosine deaminase was found in the nucleotide 
sequence database of the genome sequence of Corynebacterium glutamicum or the database of the amino acid se- 
quences in the ORF region deduced from the genome sequence. Based on these results, it is assumed that Coryne- 
bacterium glutamicum contains no ORF having adenosine deaminase activity and thus has no activity of converting 
adenosine into inosine. 

(2) Search of glycine cleavage enzyme 

[0440] The sequences (GCSP_ECOLI, GCST_ECOLI and GCSH_ECOLI) of glycine decarboxylase, aminomethyl 
transferase and an aminomethyl group carrier each of which is a component of Escherichia coli glycine cleavage 
enzyme as the amino acid sequence of the protein, of which function had been confirmed as glycine cleavage enzyme 
(EC2 1.2.10), were obtained from Swiss-prot Database. 

[0441 ] By using these full-length amino acid sequences as a query, a homology search was carried out on a nucleotide 
sequence database of the genome sequence of Corynebacterium glutamicum or a database of the ORF amino acid 
sequences deduced from the genome sequence using FASTA program. A case where E-value was le-'° or less was 
judged as being significantly homologous. As a result, no sequence significantly homologous with the glycine decar- 
boxylase the aminomethyl transferase or the aminomethyl group carrier each of which is a component of Escherichia 
coli glycine cleavage enzyme, was found in the nucleotide sequence database of the genome sequence of Coryne- 
bacterium glutamicum or the database of the ORF amino acid sequences estimated from the genome sequence. Based 
on these results, it is assumed that Corynebacterium glutamicum contains no ORF having the activity of glycine de- 
carboxylase, aminomethyl transferase or the aminomethyl group carrier and thus has no activity of the glycine cleavage 
enzyme. 

(3) Search of IMP dehydrogenase 

[0442] The amino acid sequence (IMDH ECOLI) of Escherichia coli IMP dehydrogenase as the amino acid sequence 
of the protein, of which function had been confirmed as IMP dehydrogenase (EC1.1.1 .205). was obtained from Swiss- 
prot Database. By using the full length of this amino acid sequence as a query, a homology search was carried out on 
a nucleotide sequence database of the genome sequence of Corynebacterium glutamicum or a database of the ORF 
amino acid sequences predicted from the genome sequence using FASTA program. A case where E-value was ter 
or less was judged as being significantly homologous. As a result, the amino acid sequences encoded by two ORFs. 
namely an ORF positioned in the region of the nucleotide sequence No. 61 5336 to 61 6853 (or ORF having the nucle- 
otide sequence represented by SEQ ID NO:672) and another ORF positioned in the region of the nucleotide sequence 
No 616973 to 618094 (or ORF having the nucleotide sequence represented by SEQ ID NO:674) were significantly 
homologous with the ORFs of Escherichia coli IMP dehydrogenase. By using the above-described predicted amino 
acid sequence as a query in order to examine the similarity of the amino acid sequences encoded by the ORFs with 
IMP dehydrogenases of other organisms in greater detail, a search was carried out on GenBank (http://www.ncbi.nlm. 
nih gov/) rir-aa database (amino acid sequence database constructed on the basis of GenBankCDS translation prod- 
ucts PDB database, Swiss-Prot database, PIR database, PRF database by eliminating duplicated registrations) using 
BLAST program As a result, both of the two amino acid sequences showed significant homologies with IMP dehdy- 
rogenases of other organisms and clearly higher homologies with IMP dehdy rogenases than with amino acid sequences 
of other proteins, and thus, it was assumed that the two ORFs would function as IMP dehydrogenase. Based on these 
results, it was therefore assumed that Corynebacterium glutamicum has two ORFs having the IMP dehydrogenase 
activity. 

Example 6 

Proteome analysis of proteins derived from Corynebacterium glutamicum 

(1 ) Preparations of proteins derived from Corynebacterium glutamicum ATCC 1 3032, FERM BP-7134 and FERM BP- 
158 

[0443] Culturing tests of Corynebacterium glutamicum ATCC 1 3032 (wild type strain), Corynebacterium glutamicum 
FERM BP-7134 (lysine-producing strain) and Corynebacterium glutamicum (FERM BP-158, lysine-highly producing 
strain) were carried out in a 5 1 jar fermenter according to the method in Example 2(3). The results are shown in Table 6. 
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Table 6 



Strain 


L-Lysine yield (g/l) 


ATCC 13032 


0 


FERM BP-7134 


45 


FERM BP-158 


60 



[0444] After culturing. cells of each strain were recovered by centrifugation. These cells were washed with Tns-HC 
buffer (1 0 mmol/l Tris-HCI, pH 6.5, 1 .6 mg/ml protease inhibitor (COMPLETE; manufactured by Boehringer Mannheim)) 
three times to give washed cells which could be stored under freezing at -80°C. The freeze-stored cells were thawed 
before use, and used as washed cells. ,_,■»,, B </ 

[04451 The washed cells described above were suspended in a disruption buffer (10 mmol/1 Tris-HCI, pH 7.4, 5 mmol/ 
I magnesium chloride, 50 mg/l RNase, 1 .6 mg/ml protease inhibitor (COMPLETE: manufactured by Boehnnger Man- 
nheim)) and disrupted with a disrupter (manufactured by Brown) under cooling. To the resulting disruption solution. 
DNase was added to give a concentration of 50 mg/l, and allowed to stand on ice for 10 minutes. The solution was 
centrifuged (5,000 x g, 15 minutes, 4°C) to remove the undisrupted cells as the precipitate, and the supernatant was 
recovered 

[0446] To the supernatant, urea was added to give a concentration of 9 mol/l, and an equivalent amount of a lysis 
buffer (9 5 mol/l urea 2% NP-40, 2% Ampholine. 5% mercaptoethanol, 1.6 mg/ml protease inhibitor (COMPLETE; 
manufactured by Boehringer Mannheim) was added thereto, followed by thoroughly stirring at room temperature for 

dissolving. _, . . 

[0447] After being dissolved, the solution was centrifuged at 12,000 x g for 15 m.nutes, and the supernatant was 

recovered 

[0448] To the supernatant, ammonium sulfate was added to the extent of 80% saturation, followed by thoroughly 
stirring for dissolving. 

[04491 After being dissolved, the solution was centrifuged (16,000 x g, 20 minutes. 4°C), and the precipitate was 
recovered. This precipitate was dissolved in the lysis buffer again and used in the subsequent procedures as a protein 
sample. The protein concentration of this sample was determined by the method for quantifying protein of Bradford. 

(2) Separation of protein by two dimensional electrophoresis 

[0450] The first dimensional electrophoresis was carried out as described below by the isoelectric electrophoresis 

ra45lT A molded dry IPG strip gel (pH 4-7, 13 cm. Immobiline DryStrips; manufactured by Amersham Pharmacia 
Biotech) was set in an electrophoretic apparatus (Multiphor II or IPGphor; manufactured by Amersham Pharmacia 
Biotech) and a swelling solution (8 mol/l urea. 0.5% Triton X-100. 0.6% dithiothreitol. 0.5% Ampholine. pH 3-10) was 
packed therein, and the gel was allowed to stand for swelling 12 to 16 hours. „ UAOO « 0/ 

[0452] The protein sample prepared above was dissolved in a sample solution (9 mol/l urea. 2% CHAPS, 1 /» dithi- 
othreitol. 2% Ampholine, pH 3-10), and then about 100 to 500 ug (in terms of protein) portions thereof were taken and 
added to the swollen IPG strip gel. . ,„ r . 

[0453] The electrophoresis was carried out in the 4 steps as defined below under controlling the temperature to 20 c. 

step 1 : 1 hour under a gradient mode of 0 to 500V; 
step 2: 1 hour under a gradient mode of 500 to 1 ,000 V; 
step 3: 4 hours under a gradient mode of 1 ,000 to 8,000 V; and 
step 4; 1 hour at a constant voltage of 8,000 V. 

[0454] After the isoelectric electrophoresis, the IPG strip gel was put off from the holder and soaked in an equilibration 
buffer A (50 mmol/l Tris-HCI, pH 6.8. 30% glycerol. 1% SDS, 0.25% dithiothreitol) for 15 minutes and another equili- 
bration buffer B (50 mmol/l Tris-HCI, pH 6.8, 6 mol/l urea, 30% glycerol. 1% SDS, 0.45% iodo acetamide) for 1 5 m.nutes 
to sufficiently equilibrate the gel. A <0/ 

[0455] After the equilibrium, the IPG strip gel was lightly rinsed in an SDS electrophoresis buffer (1 .4 h glycine, 0.1 A 
SDS. 0.3% Tris-HCI. pH 8.5), and the second dimensional electrophoresis depending on molecular weight was earned 
out as described below to separate the proteins. 

[0456] Specifically, the above IPG strip gel was closely placed on 1 4% polyacrylamide slub gel (1 4% polyacrylamide, 
0.37% bisacrylamide, 37.5 mmol/l Tris-HCI, pH 8.8, 0.1% SDS, 0.1% TEMED, 0.1% ammonium persuifate) and sub- 
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jected to electrophoresis under a constant voltage of 30 mA at 20°C for 3 hours to separate the proteins. 



(3) Detection of protein spot 



5 [0457] Coomassie staining was performed by the method of Gorg et al. {Electrophoresis, &. 531-546 (1988)) for the 
slub gel after the second dimensional electrophoresis. Specifically, the slub gel was stained under shaking at 25°C for 
about 3 hours, the excessive coloration was removed with a decoloring solution, and the gel was thoroughly washed 
with distilled water. 

[0458] The results are shown in Fig. 2. The proteins derived from the ATCC 13032 strain (Fig. 2A) t FERM BP-7134 
10 strain (Fig. 2B) and FERM BP-158 strain (Fig. 2C) could be separated and detected as spots. 

(4) In-gel digestion of detected protein spot 

[0459] The detected spots were each cut out from the gel and transferred into siliconized tube, and 400 u.l of 100 
15 mmol/1 ammonium bicarbonate : acetonitrile solution (1:1, v/v) was added thereto, followed by shaking overnight and 
freeze-dried as such. To the dried gel, 10u.l of alysylendopeptidase(LysC) solution (manufactured by WAKO, prepared 
with 0.1% SDS-containing 50 mmol/l ammonium bicarbonate to give a concentration of 100 ng/uJ) was added and the 
gel was allowed to stand for swelling at 0°C for 45 minutes, and then allowed to stand at 37°C for 16 hours. After 
removing the LysC solution, 20 u.l of an extracting solution (a mixture of 60% acetonitrile and 5% formic acid) was 
20 added, followed by ultrasonication at room temperature for 5 minutes to disrupt the gel. After the disruption, the extract 
was recovered by centrifugation (12,000 rpm, 5 minutes, room temperature). This operation was repeated twice to 
recover the whole extract. The recovered extract was concentrated by centrifugation in vacuo to halve the liquid volume. 
To the concentrate, 20 u.l of 0.1% trifluoroacetic acid was added, followed by thoroughly stirring, and the mixture was 
subjected to desalting using ZipTlp (manufactured by Millipore). The protein absorbed on the carriers of ZipTip was 
25 eluted with 5 u.l of a-cyano-4-hydroxycinnamic acid for use as a sample solution for analysis. 

(5) Mass spectrometry and amino acid sequence analysis of protein spot with matrix assisted laser desorption ionization 
time of flight mass spectrometer (MALDI-TOFMS) 

30 [0460] The sample solution for analysis was mixed in the equivalent amount with a solution of a peptide mixture for 
mass calibration (300 nmol/l Angiotensin II, 300 nmol/l Neurotensin, 150 nmol/l ACTHcltp 18-39, 2.3 u.mol/1 bovine 
insulin B chain), and 1 u.l of the obtained solution was spotted on a stainless probe and crystallized by spontaneously 
drying. 

[0461] As measurement instruments, REFLEX MALDI-TOF mass spectrometer (manufactured by Bruker) and an 
35 N2 laser (337 nm) were used in combination. 

[0462] The analysis by PMF (peptide-mass finger printing) was carried out using integration spectra data obtained 
by measuring 30 times at an accelerated voltage of 19.0 kV and a detector voltage of 1.50 kV under reflector mode 
conditions. Mass calibration was carried out by the internal standard method. 

[0463] The PSD (post-source decay) analysis was carried out using integration spectra obtained by successively 
40 altering the reflection voltage and the detector voltage at an accelerated voltage of 27.5 kV. 

[0464] The masses and amino acid sequences of the peptide fragments derived from the protein spot after digestion 
were thus determined, 

(6) Identification of protein spot 

45 

[0465] From the amino acid sequence information of the digested peptide fragments derived from the protein spot 
obtained in the above (5), ORFs corresponding to the protein were searched on the genome sequence database of 
Corynebacterium glutamicum ATCC 13032 as constructed in Example 1 to identify the protein. 
[0466] The identification of the protein was carried out using MS-Fit program and MS-Tag program of intranet protein 
so prospector. 

(a) Search and identification of gene encoding high-expression protein 

[0467] In the proteins derived from Corynebacterium glutamicum ATCC 1 3032 showing high expression amounts in 
55 CBB-staining shown in Fig. 2A, the proteins corresponding to Spots-1 , 2, 3, 4 and 5 were identified by the above method. 
[0468] As a result, it was found that Spot-1 corresponded to enolase which was a protein having the amino acid 
sequence of SEQ ID NO:4585; Spot-2 corresponded to phosphoglycelate kinase which was a protein having the amino 
acid sequence of SEQ ID NO:5254; Spot-3 corresponded to glyceraldehyde-3-phosphate dehydrogenase which was 



f 
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a protein having the amino acid sequence represented by SEQ ID NO:5255; Spot-4 corresponded to fructose bis- 
phosphate aldolase which was a protein having the amino acid sequence represented by SEQ ID NO:6543; and Spot- 
5 corresponded to those phosphate isomerase which was a protein having the amino acid sequence represented by 
SEQ ID NO:5252. 

5 [0469] These genes, represented by SEQ ID NOS:1085, 1754, 1775, 3043 and 1752 encoding the proteins corre- 
sponding to Spots-1, 2, 3, 4 and 5, respectively, encoding the known proteins are important in the central metabolic 
pathway for maintaining the life of the microorganism. Particularly, it is suggested that the genes of Spots-2, 3 and 5 
form an operon and a high-expression promoter is encoded in the upstream thereof (J. of Eacteriol., 174: 6067-6086 
(1992)). 

w [0470] Also, the protein corresponding to Spot-9 in Fig. 2 was identified in the same manner as described above, 
and it was found that Spot-9 was an elongation factor Tu which was a protein having the amino acid sequence repre- 
sented by SEQ ID No:6937, and that the protein was encoded by DNA having the nucleotide sequence represented 
by SEQ ID No:3437. 

[0471] Based on these results, the proteins having high expression level were identified by proteome analysis using 
is the genome sequence database of Corynebacterium glutamicum constructed in Example 1 . Thus, the nucleotide se- 
quences of the genes encoding the proteins and the nucleotide sequences upstream thereof could be searched simul- 
taneously. Accordingly, it is shown that nucleotide sequences having a function as a high-expression promoter can be 
efficiently selected. 



20 (b) Search and identification of modified protein 

[0472] Among the proteins derived from Corynebacterium glutamicum FERM BP-7134 shown in Fig. 2B, Spots-6, 
7 and 8 were identified by the above method. As a result, these three spots all corresponded to catalase which was a 
protein having the amino acid sequence represented by SEQ ID NO:3785. 
25 [0473] Accordingly, all of Spots-6, 7 and 8 detected as spots differing in isoelectric mobility were alt products derived 
from a catalase gene having the nucleotide sequence represented by SEQ ID No:285. Accordingly, it is shown that 
the catalase derived from Corynebacterium glutamicum FERM BP-7134 was modified after the translation. 
[0474] Based on these results, it is confirmed that various modified proteins can be efficiently searched by proteome 
analysis using the genome sequence database of Corynebacterium glutamicum constructed in Example 1 . 

30 

(c) Search and identification of expressed protein effective in lysine production 

[0475] It was found out that in Fig. 2A (ATCC 13032: wild type strain), Fig. 2B (FERM BP-7134: lysine-producing 
strain) and Fig. 2C (FERM BP-158: lysine-highly producing strain), the catalase corresponding to Spot-8 and the elon- 
35 gation factor Tu corresponding to Spot-9 as identified above showed the higher expression level with an increase in 
the lysine productivity. 

[0476] Based on these results, it was found that hopeful mutated proteins can be efficiently searched and identified 
in breeding aiming at strengthening the productivity of a target product by the proteome analysis using the genome 
sequence database of Corynebacterium glutamicum constructed in Example 1 . 
40 [0477] Moreover, useful mutation points of useful mutants can be easily specified by searching the nucleotide se- 
quences (nucleotide sequences of promoter, ORF, or the like) relating to the identified proteins using the above data- 
base and using primers designed on the basis of the sequences. As a result of the fact that the mutation points are 
specified, industrially useful mutants which have the useful mutations or other useful mutations derived therefrom can 
be easily bred. 

45 [0478] While the invention has been described in detail and with reference to specific embodiments thereof, it will 
be apparent to one of skill in the art that various changes and modifications can be made therein without departing 
from the spirit and scope thereof. All references cited herein are incorporated in their entirety. 



so Claims 

1 . A method for at least one of the following: 

(A) identifying a mutation point of a gene derived from a mutant of a coryneform bacterium, 
55 (B) measuring an expression amount of a gene derived from a coryneform bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium, 

(D) analyzing expression patterns of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a coryneform bacterium, 
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said method comprising: 

(a) producing a polynucleotide array by adhering to a solid support at least two polynucleotides selected 
from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any 
one of SEQ ID NOS: 1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 
stringent conditions, and third polynucleotides comprising a sequence of 10 to 200 continuous bases of 
the first or second polynucleotides, 

(b) incubating the polynucleotide array with at least one of a labeled polynucleotide derived from a co- 
ryneform bacterium, a labeled polynucleotide derived from a mutant of the coryneform bacterium or a 
labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 

The method according to claim 1, wherein the coryneform bacterium is a microorganism belonging to the genus 
Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

The method according to claim 2, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogiutamicum, Corynebacterium caltunae, Corynebacterium herculis, Corynebacterium litium, Corynebacteri- 
um meiassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

The method according to claim 1 , wherein the polynucleotide derived from a coryneform bacterium, the polynuce- 
lotide derived from a mutant of the coryneform bacterium or the polynucleotide to be examined is a gene relating 
to the biosynthesis of at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid, and analogues thereof. 

The method according to claim 1 , wherein the polynucleotide to be examined is derived from Escherichia coll 
. A polynucleotide array, comprising: 

at least two polynucleotides selected from the group consisting of first polynucleotides comprising the nucle- 
otide sequence represented by any one of SEQ ID NOS:1 to 3501, second polynucleotides which hybridize 
with the first polynucleotides under stringent conditions, and third polynucleotides comprising 10 to 200 con- 
tinuous bases of the first or second polynucleotides, and 
a solid support adhered thereto. 

7. A polynucleotide comprising the nucleotide sequence represented by SEQ ID NO: 1 or a polynucleotide having a 
homology of at least 80% with the polynucleotide. 

8. A polynucleotide comprising any one of the nucleotide sequences represented by SEQ ID NOS:2 to 3431 , or a 
polynucleotide which hybridizes with the polynucleotide under stringent conditions. 

9. A polynucleotide encoding a polypeptide having any one of the amino acid sequences represented by SEQ ID 
NOS:3502 to 6931 , or a polynucleotide which hybridizes therewith under stringent conditions. 

1 0 A polynucleotide which is present in the 5' upstream or 3 ( downstream of a polynucleotide comprising the nucleotide 
sequence of any one of SEQ ID NOS:2 to 3431 in a whole polynucleotide comprising the nucleotide sequence 
represented by SEQ ID NO:1 , and has an activity of regulating an expression of the polynucleotide. 

1 1 A polynucleotide comprising 10 to 200 continuous bases in the nucleotide sequence of the polynucleotide of any 
one of claims 7 to 10, or a polynucleotide comprising a nucleotide sequence complementary to the polynucleotide 
comprising 10 to 200 continuous based. 

12. A recombinant DNA comprising the polynucleotide of any one of claims 8 to 11 . 

13. A transformant comprising the polynucleotide of any one of claims 8 to 11 or the recombinant DNA of claim 12. 



14. A method for producing a polypeptide, comprising: 
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culturing the transformant of claim 13 in a medium to produce and accumulate a polypeptide encoded by the 
polynucleotide of claim 8 or 9 in the medium, and 
recovering the polypeptide from the medium. 

5 15. A method for producing at least one of an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof, comprising: 

culturing the transformant of claim 13 in a medium to produce and accumulate at least one of an amino acid, 
a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof in the medium, and 
w recovering the at least one of the amino acid, the nucleic acid, the vitamin, the saccharide, the organic acid, 

and analogues thereof from the medium. 

16. A polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from SEQ ID NOS:2 to 



15 



3431. 



17. A polypeptide comprising the amino acid sequence selected from SEQ ID NOS:3502 to 6931, 



18 The polypeptide according to claim 16 or 17, wherein at least one amino acid is deleted, replaced, inserted or 
added, said polypeptides having an activity which is substantially the same as that of the polypeptide without satd 
20 at least one amino acid deletion, replacement, insertion or addition. 

19. A polypeptide comprising an amino acid sequence having a homology of at least 60% with the amino acid sequence 
of the polypeptide of claim 1 6 or 1 7, and having an activity which is substantially the same as that of the polypeptide. 

25 20. An antibody which recognizes the polypeptide of any one of claims 16 to 19. 

21. A polypeptide array, comprising: 

at least one polypeptide or partial fragment polypeptide selected from the polypeptides of claims 1 6 to 1 9 and 
30 partial fragment polypeptides of the polypeptides, and 

a solid support adhered thereto. 

22. A polypeptide array, comprising: 

35 at least one antibody which recognizes a polypeptide or partial fragment polypeptide selected from the polypep- 

tides of claims 16 to 19 and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

23. A system based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
40 form bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:1 
to 3501 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

45 (iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 

1 to 3501 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
target sequence or target structure motif information; and 

(iv) an output device mat shows a screening or analyzing result obtained by the comparator. 

50 

24. A method based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501, target se- 
55 quence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501 with 
the target sequence or target structure motif information; and 
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(iv) screening and analyzing nucleotide sequence information which is coincident with or analogous to the 
target sequence or target structure motif information. 

25. A system based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target sequence or target structure motif information, recorded by the data storage 
device for screening and analyzing amino acid sequence information which is coincident with or analogous to 
the target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

26. A method based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , and target 
sequence information or target structure motif information into a user input device; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 
with the target sequence or target structure motif information; and 

(iv) screening and analyzing amino acid sequence information which is coincident with or analogous to the 
target sequence or target structure motif information. 

27. A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having a 
target nucleotide sequence derived from a coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence information selected from SEQ ID NOS:2 
to 3501, function information of a polypeptide encoded by the nucleotide sequence, and target nucleotide 
sequence information; 

(ii) a data storage device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 
2 to 3501 with the target nucleotide sequence information for determining a function of a polypeptide encoded 
by a polynucleotide having the target nucleotide sequence which is coincident with or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID NOS:2 to 3501 ; and 

(iv) an output devices that shows a function obtained by the comparator. 

28. A method based on a computer for determining a function of a polypeptide encoded by a polypeptide encoded by 
a polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, comprising the fol- 
lowing: 

(i) inputting at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501, function in- 
formation of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 with 
the target nucleotide sequence information; and 

(iv) determining a function of a polypeptide encoded by a polynucleotide having the target nucleotide sequence 
which is coincident with or analogous to the polynucleotide having at least one nucleotide sequence selected 
from SEQ ID NOS:2 to 3501 . 

29. A system based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 , function information based on the amino acid sequence, and target amino acid sequence infor- 
mation; 
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(ii) a data storing device for at least temporarily storing the input information; 

(iii) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target amino acid sequence information for determining a function of a polypeptide 
having the target amino acid sequence which is coincident with or analogous to the polypeptide having at least 

5 one amino acid sequence selected from SEQ ID NOS:3502 to 7001 ; and 

(iv) an output device that shows a function obtained by the comparator. 

30. A method based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

w 

(i) inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , function 
information based on the amino acid sequence, and target amino acid sequence information; 

(ii) at least temporarily storing said information; 

(iii) comparing the at least one amino acid sequence information selected from SEQ ID NOS; 3502 to 7001 
15 with the target amino acid sequence information; and 

(iv) determining a function of a polypeptide having the target amino acid sequence which is coincident with or 
analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS:3502 to 
7001. 

20 31 . The system according to any one of claims 23, 25, 27 and 29, wherein a coryneform bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

32. The method according to any one of claims 24, 26, 28 and 30, wherein a coryneform bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

25 

33. The system according to claim 31 , wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetoglutamicum, Corynebacterium caliunae, Corynebacterium herculis, Corynebacterium lilium, Corynebacteri- 
um melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

30 

34. The method according to claim 32, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetoglutamicum, Corynebacterium caliunae, Corynebacterium herculis, Corynebacterium lilium, Corynebacteri- 
um melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

35 

35. A recording medium or storage device which is readable by a computer in which at least one nucleotide sequence 
information selected from SEQ ID NOS:1 to 3501 or function information based on the nucleotide sequence is 
recorded, and is usable in the system of claim 23 or 27 or the method of claim 24 or 28. 

to 36. A recording medium or storage device which is readable by a computer in which at least one amino acid sequence 
information selected from SEQ ID NOS:3502 to 7001 or function information based on the amino acid sequence 
is recorded, and is usable in the system of claim 25 or 29 or the method of claim 26 or 30. 

37. The recording medium or storage device according to claim 35 or 36, which is a computer readable recording 
45 medium selected from the group consisting of a floppy disc, a hard disc, a magnetic tape, a random access memory 

(RAM), a read only memory (ROM), a magneto-optic disc (MO), CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM 
and DVD-RW. 

38. A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the Val 
so residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform bacterium 

is replaced with an amino acid residue other than a Val residue. 

39. A polypeptide comprising an amino acid sequence in which the Val residue at the 59th position in the amino acid 
sequence as represented by SEQ ID NO:6952 is replaced with an amino acid residue other than a Val residue. 

55 

40. The polypeptide according to claim 38 or 39, wherein the Val residue at the 59th position is replaced with an Ala 
residue. 
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41. A polypeptide having pyruvate carboxylase activity, comprising an amino acid sequence in which the Pro residue 
at the 458th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform bacterium is 
replaced with an amino acid residue other than a Pro residue. 

5 42. A polypeptide comprising an amino acid sequence in which the Pro residue at the 458th position in the amino acid 
sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

43. The polypeptide according to claim 41 or 42, wherein the Pro residue at the 458th position is replaced with a Ser 
residue. 

w 

44. The polypeptide according to any one of claims 38 to 43, which is derived from Corynebacterium giutamicum. 

45. A DNA encoding the polypeptide of any one of claims 38 to 44. 
'5 46. A recombinant DNA comprising the DNA of claim 45. 

47. A transformant comprising the recombinant DNA of claim 46. 

48. A transformant comprising in its chromosome the DNA of claim 45. 

20 

49. The transformant according to claim 47 or 48, which is derived from a coryneform bacterium. 

50. The transformant according to claim 49, which is derived from Corynebacterium giutamicum. 

25 51. A method for producing L-lysine, comprising: 

culturing the transformant of any one of claims 47 to 50 in a medium to produce and accumulate L-lysine in 
the medium, and 

recovering the L-lysine from the culture. 

30 

52. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:1 to 3431 , comprising the following: 



(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
35 rium which has been subjected to mutation breeding so as to produce at least one compound selected from 

an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 
method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431; 

(ii) identifying a mutation point present in the production strain based on a result obtained by (i); 

(iii) introducing the mutation point into a coryneform bacterium which is free of the mutation point, or deleting 
*o the mutation point from a coryneform bacterium having the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
bacterium obtained in (iii). 



53. The method according to claim 52, wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
45 a signal transmission pathway. 

54. The method according to claim 52, wherein the mutation point is a mutation point relating to a useful mutation 
which improves or stabilizes the productivity. 

50 55. A method for breading a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:1 to 3431, comprising: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 

55 an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 

method, with a corresponding nucleotide sequence in SEQ ID NOS:1 to 3431 ; 

(ii) identifying a mutation point present in the production strain based on a result obtain by (i); 

(iii) deleting a mutation point from a coryneform bacterium having the mutation point; and 
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(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
bacterium obtained in (iii). 

56. The method according to claim 55, wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 
5 a signal transmission pathway. 

57. The method according to claim 55, wherein the mutation point is a mutation point which decreases or destabilizes 
the productivity. 

10 58. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:2 to 3431 , comprising the following: 



(i) identifying an isozyme relating to biosynthesis of at least one compound selected from an amino acid, a 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 

*5 quence information represented by SEQ ID NOS:2 to 3431; 

(ii) classifying the isozyme identified in (i) into an isozyme having the same activity; 

(iii) mutating all genes encoding the isozyme having the same activity simultaneously; and 

(iv) examining productivity by a fermentation method of the compound selected in (i) of the coryneform bac- 
terium which have been transformed with the gene obtained in (iii). 

20 

59. A method for breeding a coryneform bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:2 to 3431 , comprising the following: 



(i) arranging a function information of an open reading frame (ORF) represented by SEQ ID NOS:2 to 3431 ; 
25 (ji) allowing the arranged ORF to correspond to an enzyme on a known biosynthesis or signal transmission 

pathway; 

(iii) explicating an unknown biosynthesis pathway or signal transmission pathway of a coryneform bacterium 
in combination with information relating known biosynthesis pathway or signal transmission pathway of a co- 
ryneform bacterium; 

30 (jv) comparing the pathway explicated in (iii) with a biosynthesis pathway of a target useful product; and 

(v) transgenetically varying a coryneform bacterium based on the nucleotide sequence information to either 
strengthen a pathway which is judged to be important in the biosynthesis of the target useful product in (iv) or 
weaken a pathway which is judged not to be important in the biosynthesis of the target useful product in (iv). 



35 60. A coryneform bacterium, bred by the method of any one of claims 52 to 59. 



61. The coryneform bacterium according to claim 60, which is a microorganism belonging to the genus Corynebacte- 
rium, the genus Brevibacterium, or the genus Microbacterium. 



^0 62. The coryneform bacterium according to claim 61 , wherein the microorganism belonging to the genus Corynebac- 
terium is selected from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, 
Corynebacterium acetog/utamicum, Corynebacterium callunae, Corynebacterium herculis, corynebacterium //'/- 
ium, Corynebacterium melassecola, Corynebacterium thermoamino genes, and Corynebacterium ammonia 
genes. 

45 

63. A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid and an analogue thereof, comprising: 

culturing a coryneform bacterium of any one of claims 60 to 62 in a medium to produce and accumulate at 
50 least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, 

and analogues thereof; 
recovering the compound from the culture. 

64. The method according to claim 63, wherein the compound is L-lysine. 

55 

65. A method for identifying a protein relating to useful mutation based on proteome analysis, comprising the following: 



(i) preparing 
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a protein derived from a bacterium of a production strain of a coryneform bacterium which has been sub- 
jected to mutation breeding by a fermentation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 

5 

(ii) separating the proteins prepared in (i) by two dimensional electrophoresis; 

(iii) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; 

(iv) treating the protein showing different expression amounts as a result of the comparison with a peptidase 
w to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (iv); and 

(vi) comparing the amino acid sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

J5 66. The method according to claim 65, wherein the coryneform bacterium is a microorganism belonging to the genus 
corynebacterium, the genus Brevibacterium, or the genus Microbactehum. 

67. The method according to claim 66, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 

20 acetoglutamicum, Corynebacterium callunae, Corynebacterium herculis, Corynebacterium iilium, Corynebacteri- 

um meiassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

68. A biologically pure culture of Corynebacterium glutamicum AHP-3 (FERM BP-7382) . 

25 
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